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Preface 


The purpose of this book is to provide an introduction to principles of 
probability, random variables, and random processes and their applications. 


The book is designed for students in various disciplines of engineering, 
science, mathematics, and management. It may be used as a textbook and/or as 
a supplement to all current comparable texts. It should also be useful to those 
interested in the field for self-study. The book combines the advantages of both 
the textbook and the so-called review book. It provides the textual explanations 
of the textbook, and in the direct way characteristic of the review book, it gives 
hundreds of completely solved problems that use essential theory and 
techniques. Moreover, the solved problems are an integral part of the text. The 
background required to study the book is one year of calculus, elementary 
differential equations, matrix analysis, and some signal and system theory, 
including Fourier transforms. 


I wish to thank Dr. Gordon Silverman for his invaluable suggestions and 
critical review of the manuscript. I also wish to express my appreciation to the 
editorial staff of the McGraw-Hill Schaum Series for their care, cooperation, 
and attention devoted to the preparation of the book. Finally, I thank my wife, 
Daisy, for her patience and encouragement. 
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Chapter 1 


Probability 


11 INTRODUCTION 


The study of probability stems from the analysis of certain games of chance, and it has found 
applications in most branches of science and engineering. In this chapter the basic concepts of prob- 
ability theory are presented. 


12 SAMPLE SPACE AND EVENTS 
A. Random Experiments: 


In the study of probability, any process of observation is referred to as an experiment. The results 
of an observation are called the outcomes of the experiment. An experiment is called a random experi- 
ment if its outcome cannot be predicted. Typical examples of a random experiment are the roll of a 
die, the toss of a coin, drawing a card from a deck, or selecting a message signal for transmission from 
several messages. 


B. Sample Space: 

The set of all possible outcomes of a random experiment is called the sample space (or universal 
set), and it is denoted by S. An element in S is called a sample point. Each outcome of a random 
experiment corresponds to a sample point. 

EXAMPLE 1.1 Find the sample space for the experiment of tossing a coin (a) once and (b) twice. 
(a) There are two possible outcomes, heads or tails. Thus 
S = (H, T) 
where H and T represent head and tail, respectively. 
(b) There are four possible outcomes. They are pairs of heads and tails. Thus 
S = (HH, HT, TH, TT} 
EXAMPLE 1.2 Find the sample space for the experiment of tossing a coin repeatedly and of counting the number 
of tosses required until the first head appears. 
Clearly all possible outcomes for this experiment are the terms of the sequence 1, 2, 3, ... . Thus 
$2(,2, 3...) 


Note that there are an infinite number of outcomes. 


EXAMPLE 1.3 Find the sample space for the experiment of measuring (in hours) the lifetime of a transistor. 
Clearly all possible outcomes are all nonnegative real numbers. That is, 
S={:0<1< оо} 


where т represents the life of a transistor in hours. 


Note that any particular experiment can often have many different sample spaces depending on the observ- 
ation of interest (Probs. 1.1 and 1.2). A sample space S is said to be discrete if it consists of a finite number of 
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sample points (as in Example 1.1) or countably infinite sample points (as in Example 1.2). A set is called countable 
if its elements can be placed in a one-to-one correspondence with the positive integers. A sample space $ is said 
to be continuous if the sample points constitute a continuum (as in Example 1.3). 


C. Events: 


Since we have identified a sample space S as the set of all possible outcomes of a random experi- 
ment, we will review some set notations in the following. 
If ¢ is an element of S (or belongs to 5), then we write 


бе 5 
If S is not an element of S (or does not belong to S), then we write 


Ces 
A set A is called a subset of B, denoted by - 
AcB 


if every element of A is also an element of B. Any subset of the sample space S is called an event. A 
sample point of S is often referred to as an elementary event. Note that the sample space S is the 
subset of itself, that is, S c S. Since S is the set of all possible outcomes, it is often called the certain 
event. 


EXAMPLE 1.4 Consider the experiment of Example 1.2. Let A be the event that the number of tosses required 
until the first head appears is even. Let B be the event that the number of tosses required until the first head 
appears is odd. Let C be the event that the number of tosses required until the first head appears is less than 5. 
Express events A, B, and C. 


A-(24,6...] 
B= {1,3,5,...} 
C= (1, 2, 3, 4) 


1.3 ALGEBRA OF SETS 
A. Set Operations: 
1. Equality: 
Two sets А and B are equal, denoted A = B, if and only if A c B and B c А. 
2. Complementation: 


Suppose A c S. The complement of set A, denoted A, is the set containing all elements in S but 
not in A. 


A = {€:€e Sand ¢ ¢ A} 
3. Union: 


The union of sets A and B, denoted A ‹ В, is the set containing all elements in either A or B or 
both. 


Avo B2slt:teAorte B} 
4. Intersection: 


The intersection of sets A and B, denoted А n B, is the set containing all elements in both A 
and B. 


An Bz-(t:teAand(e B] 
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5. Null Set: 
The set containing no element is called the null set, denoted @. Note that 
0 = $ 
6. Disjoint Sets: 


Two sets A and B are called disjoint or mutually exclusive if they contain no common element, 
thatis,if A ^ B= Ø. 

The definitions of the union and intersection of two sets can be extended to any finite number of 
sets as follows: 


UASA UAU А, 
i=l 


Li 


—-(CCeA,orteA, or CeA, 


A, =A, AN А; суз NA, 
isl 


= {t:e A, and( e A; and --- Ce Aj) 


Note that these definitions can be extended to an infinite number of sets: 


E 


al 


In our definition of event, we state that every subset of S is an event, including S and the null set 
@. Then 


S = the certain event 
@ = the impossible event 


If A and B are events in S, then 


A = the event that A did not occur 
А о В = the event that either A or B or both occurred 
A ^ В = the event that both A and B occurred 


Similarly, if A,, A2,..., A, are a sequence of events in S, then 


n 
UU A; = the event that at least one of the А; occurred; 


n 


(^| Ai = the event that all of the A; occurred. 


= 1 


B. Venn Diagram: 


A graphical representation that is very useful for illustrating set operation is the Venn diagram. 
For instance, in the three Venn diagrams shown in Fig. 1-1, the shaded areas represent, respectively, 
the events А о B, A ^ B, and A. The Venn diagram in Fig. 1-2 indicates that B c A and the event 
A ^ B is shown as the shaded area. 
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(a) Shaded region: A А (b) Shaded region: A A B 





t) Shaded region: A 


Fig. 1-1 








BoA 


Shaded region: A O B 


Fig. 1-2 


C. Identities: 


By the above set definitions or reference to Fig. 1-1, we obtain the following identities: 


The union and intersection operations also satisfy the following laws: 


Commutative Laws: 
AUB=BUA 
AnB-2BnA 


Associative Laws: 
AV(BUQ=(AUBUC 
Ао (Во С) = (А л BAC 


[СНАР 1 


(1.1) 
(1.2) 
(1.3) 
(1.4) 
(1.5) 
(1.6) 
(1.7) 


(1.8) 
(1.9) 


(1.10) 
(1.11) 
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Distributive Laws: 





^ (Во С) = (А с В) о (А с С) (1.12) 
Ао (Вп С) = (А о В) л (А о С) (1.13) 
De Morgan’s Laws: 
Ао В= А г В (1.14) 
Аг В= А о В (1.15) 


These relations are verified by showing that any element that is contained in the set on the left side of 
the equality sign is also contained in the set on the right side, and vice versa. One way of showing this 
is by means of a Venn diagram (Prob. 1.13). The distributive laws can be extended as follows: 


an(UB )- Ои ав) (1.16) 
AU (A в) = Na U B) (1.17) 





(04)- (^ 4, (1.18) 





(04)- ar (1.19) 


1.4 THE NOTION AND AXIOMS OF PROBABILITY 


An assignment of real numbers to the events defined in a sample space S is known as the prob- 
ability measure. Consider a random experiment with a sample space S, and let A be a particular event 
defined in S. 


A. Relative Frequency Definition: 


Suppose that the random experiment is repeated n times. If event A occurs n(A) times, then the 
probability of event A, denoted P(A), is defined as 


P(A) = lim = (1.20) 


пә со 


where n(A)/n is called the relative frequency of event A. Note that this limit may not exist, and in 
addition, there are many situations in which the concepts of repeatability may not be valid. It is clear 
that for any event A, the relative frequency of A will have the following properties: 


1. O<n(A)/n x 1, where n(A)/n = 0 if A occurs in none of the n repeated trials and n(A)/n = 1 if A 
occurs in all of the n repeated trials. 


2. If A and В are mutually exclusive events, then 


n(A o B) = n(A) + n(B) 
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and 





B. Axiomatic Definition: 


Let S be a finite sample space and A be an event in S. Then in the axiomatic definition, the 
probability P(A) of the event A is a real number assigned to A which satisfies the following three 
axioms: 


Axiom 1: P(A4)20 (1.21) 
Axiom 2: P(S)=1 (1.22) 
Axiom 3: P(A о В) = P(A) + P(B) Ас В= 2 (1.23) 


If the sample space S is not finite, then axiom 3 must be modified as follows: 


Axiom 3’: If A,, Az, ... is an infinite sequence of mutually exclusive events in S (4; ^ А; = Ø 
for i # j), then 


(0 4) = X P(Aj) (1.24) 


These axioms satisfy our intuitive notion of probability measure obtained from the notion of relative 
frequency. 


C. Elementary Properties of Probability: 


By using the above axioms, the following useful properties of probability can be obtained: 


1. P(A) = 1 — P(A) (1.25) 
2. P(g)-20 (1.26) 
3. P(A) < P(B) if Ac B (1.27) 
4. P(A)<1 (1.28) 
5. P(A o В) = P(A) + P(B) — P(A ^ В) (1.29) 
6. ПА,, A;, ..., A, are n arbitrary events in S, then 
(Ua) -= Y P(A) — X) Aun 4) + У P(A, Aj A) 
i=1 ї= 1 i*j be jee 
= (IP P(A, 0 Aum o 0 A (1.30) 
where the sum of the second term is over all distinct pairs of events, that of the third term is over 
all distinct triples of events, and so forth. 
7. Y Ay, Az,..., A, is a finite sequence of mutually exclusive events in S (A; ^ A; = Ø fori 9 j), 


then 


of U 4) = Y P(A) (1.31) 
i=l i=] 
and a similar equality holds for any subcollection of the events. 


Note that property 4 can be easily derived from axiom 2 and property 3. Since A c S, we have 
P(A) x P(S) = 1 
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Thus, combining with axiom 1, we obtain 
0 < P(A) <1 (1.32) 
Property 5 implies that 
P(A o B) < P(A) + P(B) (1.33) 
since P(A с\ В) > 0 by axiom 1. 


15 EQUALLY LIKELY EVENTS 
A. Finite Sample Space: 
Consider a finite sample space S with n finite elements 
S = 16,6... 6 


where ¢;s are elementary events. Let Р($ ) = pi. Then 


1. О<р,<]1 i=1,2,...,n (1,34) 
2. Уре +р;›+ +p, = 1 (1.35) 
3. ПА = Ute where J is a collection of subscripts, then 

P(A) = PE = Lh (1.36) 


B. Equally Likely Events: 
When all elementary events C; (i = 1, 2, ..., n) are equally likely, that is, 
Ру\=р=с=рһ 
then from Eq. (1.35), we have 


1 
pi-- i=1,2,...,n | (1.37) 

п 
апа P(A) = — (1.38) 
where n(A) is the number of outcomes belonging to event A and п is the number of sample points 


in S. 


16 CONDITIONAL PROBABILITY 
A. Definition: 

The conditional probability of an event A given event B, denoted by P(A | B), is defined as 
P(A ^ B) 


P(A|B) - PB) P(B) > 0 (1.39) 
where P(A с\ B) is the joint probability of A and B. Similarly, 
B 
pBjp- PAP — paso (1.40) 


P(A) 
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is the conditional probability of an event B given event A. From Eqs. (1.39) and (1.40), we have 
P(A ^ B) = Р(А | В)Р(В) = P(B] A)P(A) (1.41) 
Equation (1.41) is often quite useful in computing the joint probability of events. 
B. Bayes’ Rule: 


From Eq. (1.41) we can obtain the following Bayes’ rule: 
Р(В | А)Р(А) 


P(A |В) = P(B) (1.42) 
1.7 TOTAL PROBABILITY 
The events A,, A5, ..., A, are called mutually exclusive and exhaustive if 
YAR A v Aso o А,=5 and Ас Àj — 2 i xj (1.43) 
Let B be any event in S. Then 
P(B) = ув n А) = È PBI ADP(A) (1.44) 


which is known as the total probability of event B (Prob. 1.47). Let A = A; in Eq. (1.42); then, using 
Eq. (1.44), we obtain 


P(A, | B) = LELORI) 


У Р(В|А)Р(А,) 
i=1 


(1.45) 
Note that the terms on the right-hand side are all conditioned on events A;, while the term on the left 


is conditioned on В. Equation (1.45) is sometimes referred to as Bayes’ theorem. 


18 INDEPENDENT EVENTS 


Two events A and B are said to be (statistically) independent if and only if 


P(A ^ В) = Р(А)Р(В) (1.46) 
It follows immediately that if A and B are independent, then by Eqs. (1.39) and (1.40), 
P(A|B) = P(A) and P(B| A) = P(B) (1.47) 


If two events A and B are independent, then it can be shown that A and B are also independent; that 
is (Prob. 1.53), 


P(A ^ B) = P(A)P(B) (1.48) 
Then P(A|B) = 7 = P(A) (1.49) 


Thus, if A is independent of B, then the probability of A’s occurrence is unchanged by information as 
to whether or not B has occurred. Three events A, B, C are said to be independent if and only if 
P(A ^ B ^ C) = Р(А)Р(В)Р(С) 
P(A ^ B) = P(A)P(B) 
P(A ^ C) = P(A)P(C) 
P(B ^ C) = P(B)P(C) 


(1.50) 
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We may also extend the definition of independence to more than three events. The events A,, A>,... 
A, are independent if and only if for every subset {A;,, A 
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, 


is eo Ay} (2 < k < n) Of these events, 


Р(А ^ An су, 0 Ai) = P(A P(A) --- Р(А,) (1.51) 


Finally, we define an infinite set of events to be independent if and only if every finite subset of these 
events is independent. 

To distinguish between the mutual exclusiveness (or disjointness) and independence of a collec- 
tion of events we summarize as follows: 


1. 


2. 


If (4, <= 1, 2, ..., n} is a sequence of mutually exclusive events, then 


н Ù 4) - X P(A) (1.52) 


i=1 i 


If {A,, = 1, 2, ..., п} is a sequence of independent events, then 


2l (\ A) = J] P(A) (1.53) 
i-i 


i-1 


and a similar equality holds for any subcollection of the events. 


Solved Problems 


SAMPLE SPACE AND EVENTS 


1.1. 


1.2. 


Consider a random experiment of tossing a coin three times. 


(a) 


(b) 
(a) 


(b) 


Find the sample space S, if we wish to observe the exact sequences of heads and tails 
obtained. 


Find the sample space S; if we wish to observe the number of heads in the three tosses. 
The sampling space 5, is given by 
5, = {HHH, HHT, HTH, THH, HTT, THT, TTH, TTT} 


where, for example, HTH indicates a head on the first and third throws and a tail on the second 
throw. There are eight sample points in 5,. 
The sampling space 5, is given by 

S, = {0, 1, 2, 3} 


where, for example, the outcome 2 indicates that two heads were obtained in the three tosses. The 
sample space S, contains four sample points. 


Consider an experiment of drawing two cards at random from a bag containing four cards 
marked with the integers 1 through 4. 


(a) 


(b) 
(a) 


Find the sample space S, of the experiment if the first card is replaced before the second is 
drawn. 
Find the sample space S, of the experiment if the first card is not replaced. 
The sample space S, contains 16 ordered pairs (i, j, 1 «i4, 1 €j €4, where the first number 
indicates the first number drawn. Thus, 
(14,1) (1,2 (13) (I 4) 
_ JQ 4 (2,2) (23) (2,4) 
‘13,0 62 (3,3) (3,4) 
(4,1) (42) (4,3) (4,4) 
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1.3. 


1.4. 


1.5. 


(b) 
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The sample space 5, contains 12 ordered pairs (i, /), і Æj, | &i € 4, 1 € j € 4, where the first number 
indicates the first number drawn. Thus, 


(1,2) (L3) (1,4) 
(2, 1) (2,3) (2.4) 
(3,1) (3, 2) (3, 4) 
(4, 1) (4, 2) (4, 3) 


$,- 


An experiment consists of rolling a die until a 6 is obtained. 


(b) 


Find the sample space S, if we are interested in all possibilities. 
Find the sample space S, if we are interested in the number of throws needed to get a 6. 


The sample space S, would be 
5, = {6, 
16, 26, 36, 46, 56, 
116, 126, 136, 146, 156, ...} 
where the first line indicates that а 6 is obtained in one throw, the second line indicates that а 6 is 
obtained in two throws, and so forth. 
In this case, the sample space S, is 


5; ={йї>1} = (,2,3,...) 


where i is an integer representing the number of throws needed to get a 6. 


Find the sample space for the experiment consisting of measurement of the voltage output v from 
a transducer, the maximum and minimum of which are +5 and — 5 volts, respectively. 


A suitable sample space for this experiment would be 


S={v:-5S<v<5} 


An experiment consists of tossing two dice. 


(b) 


(с) 


(d) 


Find the sample space 5. 
Find the event A that the sum of the dots on the dice equals 7. 
Find the event B that the sum of the dots on the dice is greater than 10. 
Find the event C that the sum of the dots on the dice is greater than 12. 
For this experiment, the sample space S consists of 36 points (Fig. 1-3): 

S= {(i, j): i,j = 1, 2, 3, 4, 5, 6} 


where i represents the number of dots appearing on one die and j represents the number of dots 
appearing on the other die. 


The event A consists of 6 points (see Fig. 1-3): 
A = {(1, 6), (2, 5), (3, 4), (4, 3), (5, 2), (6, 1)} 
The event B consists of 3 points (see Fig. 1-3): 
В = {(5, 6), (6, 5), (6, 6)} 


The event C is an impossible event, that is, C = @. 
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1.6. 


1.7. 


1.8. 
















5 


LUS 0,6) (3,60) (4.6) (6.6 (685) 
bo c9-e5-059 «459 8.16.5) 
(1.4) (249-3. 4.44.4) (5,4) (6,4) 
[ 1.3) 23) (G93-45.8--69 (63) 
Г 0.2 (22) (32 (4, 2)--. (6.2) 6.2) 
(л) (Q9 ($0 (41) HIP (6.1), 





LLL } | 1! 1. 





Ей. 1-3 


An automobile dealer offers vehicles with the following options: 
(a) With or without automatic transmission 


(b) With or without air-conditioning 
(с) With one of two choices of a stereo system 
(d) With one of three exterior colors 


If the sample space consists of the set of all possible vehicle types, what is the number of out- 
comes in the sample space? 


The tree diagram for the different types of vehicles is shown in Fig. 1-4. From Fig. 1-4 we see that the 
number of sample points in S is 2 x 2x 2x 3 = 24. 


Transmission Automatic Manual 


Air-conditioning 


Stereo 


Color 





Fig. 1-4 


State every possible event in the sample space S = (a, b, c, d}. 


There аге 24 = 16 possible events in S. They are Ø; {a}, (b), (c), {d}; (a, b), (a, c), (a, d), (b, c}, 
(b, 4), (e, d); (a, b, c}, (a, b, d), (a, с, d), (b, c, d); S = (а, b, c, d). 


How many events are there in a sample space 5 with n elementary events? 


Let S = (5,, 55, ..., 5,}. Let Q be the family of al! subsets of S. (Q is sometimes referred to as the power 
set of S.) Let 5; be the set consisting of two statements, that is, 


S; = (Yes, the s, is in; No, the 5, is not in) 
Then Q can be represented as the Cartesian product 


QzS,xS$,x-x8, 
= {(51, 5:,..., 5): 5; € S; for i = 1, 2,..., n} 
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Since each subset of S can be uniquely characterized by an element in the above Cartesian product, we 
obtain the number of elements in Q by 


n(Q) = n(S,)n(S2) +++ n(S,) = 2" 


where n(S;) 2 number of elements in S; = 2. 
An alternative way of finding n(Q) is by the following summation: 


n fn n n! 
n= > () = mai 


The proof that the last sum is equal to 2” is not easy. 


ALGEBRA OF SETS 
1.9. | Consider the experiment of Example 1.2. We define the events 
A = {k: k is odd} 
B=({k:4<k<7} 
C={k:1<k< 10) 
where k is the number of tosses required until the first H (head) appears. Determine the events A, 
В, С, Ао В, Во С, Ап В, Ап С, BAC andAnB. 


А = {k: k is even} = (2, 4, 6,...) 
B= (k:k=1,2,30rk>8} 


C = {kik 2 11} 

Au B={k:k is odd or k = 4, 6} 
Ви С=С 

A ^ B={5,7} 

A ^ € - (1,3, 5, 7, 9) 
Во С = В 

An В = {4, 6} 


1.10. Тһе sample space of an experiment is the real line expressed as 
5 = {vi -oo <v< о) 


(a) Consider the events 


a na 
N- 
lt || 
== 
n- © 
ІА ТА 
© S 
AAA 
Plu Np 


Determine the events 


(b) Consider the events 


o 
Iu 
~ 
= 
=- 


IA. 1А 


< = 
Pe юре 
Ll 


с 
M 
© |l 
— 
е 


& 
Il 
c 
c 
lA 
NI) 
——2 
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1.11. 


Determine the events 
Ü В, апа А В; 
i=1 
(а) It is clear that 
UA = (60v «1) 
i=L 


Noting that the A,’s are mutually exclusive, we have 


(b) Noting that В, > B, э-э B,>---, we have 


(B, = B, = i: v <4) and QB: = {v: v < 0) 


i=l i=} 


Consider the switching networks shown in Fig. 1-5. Let A,, A2, and A, denote the events that 
the switches s,, s2, and s, are closed, respectively. Let A,, denote the event that there is a closed 
path between terminals a and b. Express A,, in terms of A,, A,, and A, for each of the networks 
shown. 


s 55 5 5 ? 
а о ee tn b а fe b 
(а) 





5i 5 
—— . 
о, 
(а) 


Fig. 1-5 
(a) From Fig. 1-5(a), we see that there is a closed path between a and b only if all switches s,, s, and s, 
are closed. Thus, 
Ay = А, с А, с Аз 
(b) From Fig. 1-5(b), we see that there is a closed path between a and b if at least one switch is closed. 
Thus, 
Ag = Аро А, м А, 
(c) From Fig. 1-5(c), we see that there is a closed path between a and b if s, and either s; or s, are closed. 
Thus, 
Ay = A, A (A5 о Ах) 
Using the distributive law (1.12), we have 
Ay = (A, ^ Ag) U (A, ^ A3) 
which indicates that there is a closed path between a and b if s, and s; or s, and s; are closed. 


(d) From Fig. 1-5(d), we see that there is a closed path between a and b if either s, and s; are closed ors, 
is closed. Thus 


Аљ = (44 ^ А) о А, 
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1.13, 


1.14. 


1.15. 
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Verify the distributive law (1.12). 


Let se [А ^ (B o C)]. Then s e А and s e (B o C). This means either that se A and s € B or that 
seAandseC;thatissc(A ^ В) ors e(A A С). Therefore, 


Ао (Во Сс ЦА о B о (А с\ €] 


Next, let se (A ^ B) (А ^ C)]. Then se А and se Bor se A and se C. Thus s e A and (s e B or 
s € С). Thus, 


LAA В) о (Ас CI CAN(BUC) 
Thus, by the definition of equality, we have 
Ао (Во С) = (А п В) о (А с С) 


Using a Venn diagram, repeat Prob. 1.12. 


Figure 1-6 shows the sequence of relevant Venn diagrams. Comparing Fig. 1-6(6) and 1-6(e), we con- 
clude that 


Ал (Во С) = (А л В) о (А С) 





























(c) Shaded region: A A В Uh Shaded region A C 








(e) Shaded region: (A Су B) U (А NC) 


Fig. 1-6 


Let A and В be arbitrary events. Show that А c B if and only if А ^ B= А. 


"If" part: We show that if A ^ B = A, then Ac B. Let se A. Then s є (А ^ B), since A = А n B. 
Then by the definition of intersection, s € B. Therefore, A c B. 

"Only if" part: We show that if A c B, then A ^ В = A. Note that from the definition of the intersec- 
tion, (A ^ B) c A. Suppose s e A. If A с B, then s є B. Sos e A and s є В; that is, s e (A A В). Therefore, 
it follows that A c (A ^ B). Hence, A = A A B. This completes the proof. 


Let A be an arbitrary event in $ and let @ be the null event. Show that 
(а) AUG=A (1.54) 
(b) A^ б= @ (1.55) 
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1.16. 


1.17. 


(а) 40 8 -ísseAorse Ø} 
But, by definition, there аге no s є @. Thus, 


Ао @ = {5:564} = А 


b) AN 6 = {5: зє Aandse Ø} 
But, since there are no s є @, there cannot be an s such that s є A and s є Ø. Thus, 


Ang-g 


Note that Eq. (1.55) shows that @ is mutually exclusive with every other event and including with 
itself. 


Show that the null (or empty) set @ is a subset of every set A. 
From the definition of intersection, it follows that 
(A^ BcA and (4^ В) < В (1.56) 


for any pair of events, whether they are mutually exclusive or not. If A апа В are mutually exclusive events, 
that is, 4 ^ B = Ø, then by Eq. (1.56) we obtain 


б=А and cB (1.57) 
Therefore, for any event A, 
f cA (1.58) 


that is, is a subset of every set A. 
Verify Eqs. (1.18) and (1.19). 


(a) Suppose first that s є (0 4) then s é (0 4) 
i-1 iz] 


That is, if s is not contained in any of the events 4;, i = 1, 2, ..., n, then s is contained in А, for all 
ї = 1, 2,..., п. Thus 


se (\4, 


Next, we assume that 


Then s is contained in A; for all i = 1, 2, ..., n, which means that s is not contained in А, for any i = 1, 
2, ..., n, implying that 





Thus, se (0 4) 


This proves Eq. (1.18). 
(b) Using Eqs. (1.18) and (1.3), we have 








which is Eq. (1.19). 
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THE NOTION AND AXIOMS OF PROBABILITY 
1.18 Using the axioms of probability, prove Eq. (1.25). 


1.19. 


1.20. 


1.21. 


We have 
S=AUA and Ас А=(@ 
Thus, by axioms 2 and 3, it follows that 
P(5) == 1 = P(A) + P(A) 


from which we obtain 


P(A) = | — P(A) 
Verify Eq. (1.26). 
From Eq. (1.25), we have 
P(A) = 1 — P(A) 
Let A = Ø. Then, by Eq. (1.2), А = @ = S, and by axiom 2 we obtain 
Р(@ у= 1 Р(5) =1-1= 0 


Verify Eq. (1.27). 


Let A c B. Then from the Venn diagram shown in Fig. 1-7, we sce that 


B-2Ao(An B) and Aní(AnB-2dg 


Hence, from axiom 3, 
P(B) = PLA) + P(A ^ B) 
However, by axiom 1, P(A ^ B) > 0, Thus, we conclude that 
P(A) € РВ) if ACB 





Shaded region: A A B 


Fig. 1-7 


Verify Eq. (1.29). 


[CHAP | 


(1.59) 


From the Venn diagram of Fig. 1-8, each of the sets A U B and B can be represented, respectively, as a 


union of mutually exclusive sets as follows: 


AUB=Au (4 п В) and В = (А с В) (An B) 


Thus, by axiom 3, 
P(A o B) = P(A) + P(A ^ В) 
and P(B) = P(A ^ B) + P(A n B) 
From Eq. (1.61), we have 
РА ^ В) = P(B) — P(A ^ B) 
Substituting Eq. (1.62) into Eq. (1.60), we oblain 
P(A ‹ В) = P(A) + P(B) — P(A ^ B) 


(1.60) 
(1.61) 


(1.62) 
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5 5 
QR) р: 
Shaded region: A ^ B Shaded region: A N B 
Fig. 1-8 


1.22. Let P(A) = 0.9 and P(B) = 0.8. Show that P(A ^ B) > 0.7. 
From Eq. (1.29), we have 
P(A ^ B) = P(A) + P(B) — P(A o B) 
By Eq. (1.32, 0 < P(A ‹ B) < 1. Hence 
P(A ^ В) > P(A) + P(B) — 1 (1.63) 
Substituting the given values of P(A) and P(B) in Eq. (1.63), we get 
P(A ^ B) > 0.9 + 0.8 – 1 = 07 


Equation (1.63) is known as Bonferroni's inequality. 


123. Show that 


P(A) = P(A ^» B) + P(A ^ B) (1.64) 
From the Venn diagram of Fig. 1-9, we see that 
А = (А г B) v (A n B) and (An B (An В) = g (1.65) 


Thus, by axiom 3, we have 


P(A) = P(A ^ B) + P(A ^ B) 





1.24. Given that P(A) = 0.9, P(B) = 0.8, and P(A ^ В) = 0.75, find (a) P(A o B); (b) P(A ^ B); and (с) 
P(A ^ B). 
(а) By Eq. (1.29), we have 
P(A о B) = P(A) + P(B) — P(A ^ В) = 0.9 + 0.8 — 0.75 = 095 
(b) By Eq. (1.64) (Prob. 1.23), we have 
P(A г В) = P(A) - P(A ^ В) = 0.9 — 0.75 = 0.15 
(c) By De Morgan’s law, Eq. (1.14), and Eq. (1.25) and using the result from part (a), we get 
P(A ^ В) = P(A о В) = 1 — P(A о В) = 1 – 0.95 = 005 
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1.25. 


1.26. 


1.27. 


PROBABILITY 


For any threc events A,, A,, and A,, show that 
P(A, U A; о Ay) = P(A,) + Р(А,) + P(A3) — P(A, A A3) 
— P(A, ^ Ay) — Р(А с\ Аз) + P(A, с\ А, A Аз) 


Let B = A, о Aj. By Eq. (1.29), we have 
P(A, о В) = P(A,) + P(B) — P(A, ^ B) 
Using distributive law (1.12), we have 
А, A B= A, ^ (A, U А3) = (А4, ^ AQ) v (А, ^ Аз) 
Applying Eq. (1.29) to the above event, we obtain 
P(A, ^ B) = P(A, ^ Aj) + P(A, ^ Ag) — РКА, ^ A) ^ (А, ^ Ag)] 
= P(A, n Aj) + P(A, n Аз) – Р(А ^ Az ^ Аз) 

Applying Eq. (1.29) to the set B = A; U Aj, we have 

P(B) = P(A, о Aj) = P(A,)  P(A4) — P(A, n Ay) 
Substituting Eqs. (1.69) and (1.68) into Eq. (1.67), we get 


P(A, о A; о Аз) = P(A,) + P(A;) + P(A,) — P(A, ^ Az) — Р(А A А) 
— P(A, ^ Аз) + P(A, ^ A; с Ay) 


Prove that 
of U 4) < Y P(A) 
i=l i=1 
which is known as Boole's inequality. 
We will prove Eq. (1.70) by induction. Suppose Eq. (1.70) is true for n = k. 


(sje fo 


De Коа) (u^ e] 
(о 


<P| а) + P(A.) [by Eq. (/.33)] 
k+l 


x Y P(A) + FA, 1) = Y P(A) 


i=1 i=l 


[CHAP 1 


(1.66) 


(1.67) 


(1.68) 


(1.69) 


(1.70) 


Thus Eq. (1.70) is also true for n = k + 1. By Eq. (1.33), Eq. (1.70) is true for n = 2. Thus, Eq. (1.70) is true 


for n z 2. 


Verify Eq. (1.31). 
Again we prove it by induction. Suppose Eq. (1.31) is true for n = k. 


0а) - Y PA) 


i= i=} 


ne CG] 


Using the distributive law (1.16), we have 


(U4)o 4. = (Apa А +1) = Ш = 23 
i i21 


i=l 
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1.28. 


since А, ^ 4; = Ø fori # j. Thus, by axiom 3, we have 


к+1 k k+l 
(Ua) = (UA) + P(A.) = > P(A) 
icd 1 


which indicates that Eq. (1.31) is also true for n = k + 1. By axiom 3, Eq. (1.31) is true for n = 2. Thus, it is 
true for n > 2. 


A sequence of events ( 4,, n > 1} is said to be an increasing sequence if [Fig. 1-10(a)] 








A, CA, O°: CA, CAL, Coe (1.71a) 
whereas it is said to be a decreasing sequence if [Fig. 1-10(b)] 
A, DAD DA, D А, Do (1.71b) 








(a) (Р) 
Fig. 1-10 
If {A,, > 1} is an increasing sequence of events, we define a new event A,, by 
A, = limA, = |] A; (1.72) 
nox. i=l 
Similarly, if {A,,n > 1} is a decreasing sequence of events, we define а new event A,, by 
A, = limA, = ( A; (1.73) 
noa i=] 
Show that if (4,, n 2 1} is cither an increasing or a decreasing sequence of events, then 
lim P(A,) = P(A,,) (1.74) 


nog 
which is known as the continuity theorem of probability. 


If 1A,, n > 1} is an increasing sequence of events, then by definition 


Now, we define the events B,, n > 1, by 


В, =A, 
B, =A, 0A, 
В. = А, A А-1 


Thus, B, consists of those elements in A, that are not in any of the earlier A,, k < n. From the Venn 
diagram shown in Fig. 1-11, it is seen that B, are mutually exclusive events such that 


n а. x 
(18, = (ЈА, forall n > 1, and (JB, = (JA, = A, 


i=l i=l i-d i=l 
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Vy 


ee 








Thus, using axiom 3’, we have 


= lim D" = њр ÜB, 


no i=l no 


lim 0 U 4) = lim P(A,) (1.75) 


no i=l 


Next, if {A,, n > 1} is a decreasing sequence, then {A,, n > 1} is an increasing sequence. Hence, by Eq. 
(1.75), we have 


e( D &) = lim P(&,) 


no 


From Eq. (1.19), 








Thus, al( f 4)] = lim P(A,) (1.76) 


л 0 


Using Eq. (1.25), Eq. (1.76) reduces to 


1- қа) = lim[1 — P(A,)) = 1 — lim P(A,) 


no no 


Thus, й (\A )-^ A,) = lim P(A,) (1.77) 


noc 


Combining Eqs. (1.75) and (1.77), we obtain Eq. (1.74). 


EQUALLY LIKELY EVENTS 


1.29. Consider a telegraph source generating two symbols, dots and dashes. We observed that the dots 


were twice as likely to occur as the dashes. Find the probabilities of the dot's occurring and the 
dash's occurring. 
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1.30. 


1.31. 


1.32. 


From the observation, we have 
P(dot) = 2P(dash) 
Then, by Eq. (1.35), 
P(dot) + P(dash) = 3P(dash) = 1 
Thus, P(dash) = 4 and P(dot) = 4 


The sample space $ of a random experiment is given by 
S = (a, b, c, d) 

with probabilities P(a) = 0.2, P(b) = 0.3, P(c) = 0.4, and P(d) = 0.1. Let A denote the event (a, b], 
and B the event (b, с, d). Determine the following probabilities: (a) P(A); (b) P(B); (c) P(A); (d) 
P(A o B); and (е) P(A г B). 

Using Eq. (1.36), we obtain 
(a) P(A) = Pla) + P(b) = 0.2 + 03 = 0.5 
(b) P(B)- P(b) + P(c) + P(d) = 0.3 + 04 + 0.1 = 0.8 
(c) A= (c d); P(A) = P(c) + P(d) = 04 + 0.1 = 0.5 
(d A Uu B= (а, b, c, d} = S; P(A о В) = Р(5) = 1 
(e) A B= {b}; P(A ^ B) = P(b) = 0.3 


An experiment consists of observing the sum of the dice when two fair dice are thrown (Prob. 
1.5). Find (a) the probability that the sum is 7 and (b) the probability that the sum is greater than 
10. 


(а) Let ¢,, denote the elementary event (sampling point) consisting of the following outcome: ё,, = (i, j), 
where i represents the number appearing on one die and j represents the number appearing on the 
other die. Since the dice are fair, all the outcomes are equally likely. So P(¢;;) = 5. Let A denote the 
event that the sum is 7. Since the events ¢;, are mutually exclusive and from Fig. 1-3 (Prob. 1.5), we 
have 

P(A) = Р(б\є Y C25 о Caa Y баз U O52 Y 661) 
= Р(& |) + Plbrs) + Pls4) + P(Eas) + Ps2) + Pis.) 
= 6(45) = i 
(b) Let B denote the event that the sum is greater than 10. Then from Fig. 1-3, we obtain 
P(B) = P(Gsg о 665 U 666) = Р(656) + P(Gos) + Plees) 
= 335) = d 


There are п persons in a room. 

(a) What is the probability that at least two persons have the same birthday? 
(b) Calculate this probability for n — 50. 

(c) How large need n be for this probability to be greater than 0.5? 


(a) As each person can have his or her birthday on any one of 365 days (ignoring the possibility of 
February 29), there are a total of (365)" possible outcomes. Let A be the event that no two persons 
have the same birthday. Then the number of outcomes belonging to A is 


n(A) = (365X364) «+» (365 -- n + 1) 
Assuming that each outcome is equally likely, then by Eq. (1.38), 


_ n(A) _ (365)(364) --- (365 — n + 1) 


n(S) _ (365)" (1.78) 


P(A) 
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Let B be the event that at least two persons have the same birthday. Then B = A and by Eq. (1.25), 
P(B) = 1 — P(A). 


(b) Substituting п = 50 in Eq. (1.78), we have 
Р(А)= 003 and Р(В) & 1 ~ 0.03 = 0.97 
(c) From Eq. (1.78), when п = 23, we have 
Р(А) = 0.493 and Р(В)=1— P(A) x 0.507 


That is, if there are 23 persons in a room, the probability that at least two of them have the same 
birthday exceeds 0.5. 


1.33. А committee of 5 persons is to be selected randomly from a group of 5 men and 10 women. 


(a) Find the probability that the committee consists of 2 men and 3 women. 
(b) Find the probability that the committee consists of all women. 


15 
ns) (0) 


It is assumed that "random selection" means that each of the outcomes is equally likely. Let A be the 
event that the committee consists of 2 men and 3 women. Then the number of outcomes belonging to 


A is given by 
"7A з 
(5) 
А) \2/\ 3 400 


ME (Бу M 
5 


(a) The number of total outcomes is given by 


Thus, by Eq. (1.38), 


(b) Let B be the event that the committee consists of all women. Then the number of outcomes belonging 


to B is 
5/10 
ne- QS) 
(KS) 
_щВ) NOAS/ 36 
~ n(S) (3) ~ 429 
5 


1.34. Consider the switching network shown in Fig. 1-12. It is equally likely that a switch will or 
will not work. Find the probability that a closed path will exist between terminals a and b. 


Thus, by Eq. (1.38), 





Fig. 1-12 
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Consider a sample space S of which a typical outcome is (1, 0, 0, 1), indicating that switches 1 and 4 are 
closed and switches 2 and 3 are open. The sample space contains 2* = 16 points, and by assumption, they 
are equally likely (Fig. 1-13). 

Let A;, i= 1, 2, 3, 4 be the event that the switch s; is closed. Let A be the event that there exists a 
closed path between a and b. Then 


А = А, о (A, n Аз) v (А N A4) 
Applying Eq. (1.30), we have 
P(A) = PLA, v (А, ^ Аз) \ (4; ^ A4)] 

= P(Ai) + P(A; ^ Аз) + Р(А, ^ Aa) 
— P[A, ^ (A, ^ A] - PLA, ^ (432 A A3)] - PUA: 6 A с (45 A 44)] 
+ P[A, ^ (А, ^ Аз) A (А, A Aall 

= P(A,) + P(A, n Аз) + P(A, с Aa) 
— P(A, A А5 A A) — P(A, A А, A AQ) - P(A A Ау A Аш) 
+ P(A, ^ А, п Ay A Aa) 


Now, for example, the event A, с\ A, contains all elementary events with a 1 in the second and third 
places. Thus, from Fig. 1-13, we see that 


n(A,) = 8 n(A; с Аз) 24 ЩА ^ А) = 4 
n(A, п A; A) = 2 n(A, ^ 45.0 Ay) = 2 
n(A, ^ Аз с\ Ay) = 2 nA, 40^ 44,0 А) = 1 


Thus, 
PA) = dedo we =H x 0688 







Ém.---------o 


Fig. 1-13 


1.35. Consider the experiment of tossing a fair coin repeatedly and counting the number of tosses 
required until the first head appears. 


(a) Find the sample space of the experiment. 
(b) Find the probability that the first head appcars on the kth toss. 
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(c) Verify that P(S) = 1. 
(a) The sample space of this experiment is 
5 = {e ey, ез,...) = e: k = 1, 2, 3, ...] 


where e, is the elementary event that the first head appears on the kth toss. 


(b) Since a fair coin is tossed, we assume that a head and a tail are equally likely to appear. Then P(H) = 
P(T) = 4. Let 


P(e,) = p, k21,2,3,... 


Since there are 2* equally likely ways of tossing a fair coin k times, only one of which consists of (k — 1) 
tails following a head we observe that 





1 
Ра) = =» К=1,2,3,... (1.79) 
(c) Using the power series summation formula, we have 
w о у о 1 k i 
Р) = У Ре) = У з= У (5) =т= (1.80) 
k=1 k=1 2 k=t 2 | — T 


Consider the experiment of Prob. 1.35. 


(a) Find the probability that the first head appears on an even-numbered toss. 
(b) Find the probability that the first head appears on an odd-numbered toss. 


(a) Let A be the event “the first head appears on an even-numbered toss." Then, by Eq. (1.36) and using 
Eq. (1.79) of Prob. 1.35, we have 





© ao 1 o6 lw i 1 
Мар Ўра Y зве Ў (i) =- 373 
m=i m=1 


= т= 1 


(b) Let В be the event “the first head appears on an odd-numbered toss.” Then it is obvious that В = A. 
Then, by Eq. (/.25), we get 


P(B) = P(A) = 1 – P(A)=1—4=3 


As a check, notice that 


= 2 1 12 f1\" 1f | 2 
P(B) = + + E = —— = – - = — = – 
(В) = р, + ру + Ps + У ратат У X75 У (3) (= ;) 3 





CONDITIONAL PROBABILITY 
1.37. Show that P(A | В) defined by Eq. (1.39) satisfies the three axions of a probability, that is, 


(а) P(A|B)zO 

(Б) P(S|B)=1 

(c) P(A, o А,1В) = P(A,|B) + P(A;|] B i A, ^ A; = Ø 
(а) From definition (1.39), 


P(A ^ B) 


P(A|B) = PB 


P(B) > 0 


By axiom 1, P(A ^ B) = 0. Thus, 
P(A1B) 20 
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(b) By Eq. (1.5), 5 ^ B = B. Then 


PB P(B) 
(c) By definition (1.39), 


PA, o alp = = T 
Now by Eqs. (1.8) and (1.11), we have 
(A, о 4) ^ B-(A, ^ B o (А, n B) 
and A, ^ A, = Ø implies that (А, ^ B) A (А, ^ B) = Ø. Thus, by axiom 3 we get 


P(A, ^ B) + Р(Аз СВ) P(4, ^ В). P(A, ^ B) 
P(B) |. P(B) P(B) 
= P(A,|B)+ P(A,|B) if A, A; 07 


P(A, о A,|B)= 





1.38. Find P(A |B) if (a) A ^ B = Ø, (b) A c B, and (c) B c A. 
(а) ПА ^ B = Qj, then P(A ^ В) = P(j) = 0. Thus, 


P(A|B) = P(A ^ В) P(g) 








PB  Р(В) — 
(b) КА = B,then 4 ^ В = Aand 
pA в) = A0 BD _ РИ) 
P(B) P(B) 
(c) If B c A, then A ^ B= B and 
pajp- A80 B FD, 


P(B) PB) 


1.39. Show that if P(A | B) > P(A), then P(B| A) > P(B). 


It ав) = TOS A), then PIA гу B) > POP) Thus, 
рва) = PAO B. PAPO) _ рв or АВ|д)> P(B) 





———— > 
P(A) P(A) 


1.40. Consider the experiment of throwing the two fair dice of Prob. 1.31 behind you; you are then 
informed that the sum is not greater than 3. 


(a) Find the probability of the event that two faces are the same without the information given. 
(b) Find the probability of the same event with the information given. 


(a) Let A be the event that two faces are the same. Then from Fig. 1-3 (Prob. 1.5) and by Eq. (1.38), we 
have 


and 
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(b) Let B be the event that the sum is not greater than 3. Again from Fig. 1-3, we see that 
В = {(i, j): i + js 3} = {(1, 1), (1, 2), (2, D} 
and 
nB) 3 1 
P(B) = — = = — = — 
(B) n(S) 36 12 
Now A ^ B is the event that two faces are the same and also that their sum is not greater than 3. 
Thus, 


AaB) 1 
PA o pj AD. 


Then by definition (1.39), we obtain 


Note that the probability of the event that two faces are the same doubled from % to 4 with the 
information given. 


Alternative Solution: 


There are 3 elements in B, and 1 of them belongs to A. Thus, the probability of the same event 
with the information given is +. 


Two manufacturing plants produce similar parts. Plant 1 produces 1,000 parts, 100 of which are 
defective. Plant 2 produces 2,000 parts, 150 of which are defective. A part is selected at random 
and found to be defective. What is the probability that it came from plant 1? 


Let B be the event that "the part selected is defective," and let A be the event that "the part selected 
came from plant 1." Then A ^ B is the event that the item selected is defective and came from plant 1. 
Since a part is selected at random, we assume equally likely events, and using Eq. (1.38), we have 


P(A ^ B) = 190, = 1 
Similarly, since there are 3000 parts and 250 of them are defective, we have 
P) = #6 = 


By Eq. (1.39), the probability that the part came from plant | is 


Alternative Solution: 


There are 250 defective parts, and 100 of these are from plant 1. Thus, the probability that the 


defective part came from plant 1 is 190 = 0.4. 


A lot of 100 semiconductor chips contains 20 that are defective. Two chips are selected at 
random, without replacement, from the lot. 


(a) What is the probability that the first one selected is defective? 


(b) What is the probability that the second one selected is defective given that the first one was 
defective? 
(c) What is the probability that both are defective? 
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(a) Let A denote the event that the first one selected is defective. Then, by Eq. (1.38), 
P(A) = 4$; = 02 


(b) Let B denote the event that the second one selected is defective. After the first one selected is defective, 
there are 99 chips left in the lot with 19 chips that are defective. Thus, the probability that the second 
one selected is defective given that the first one was defective is 


P(B| A) = 18 = 0.192 
(c) By Eq. (1.41), the probability that both are defective is 
P(A ^ B) = P(B| A)P(A) = (43)(0.2) = 0.0384 


À number is selected at random from (1, 2, ..., 100). Given that the number selected is divisible 
by 2, find the probability that it is divisible by 3 or 5. 


Let A, = event that the number is divisible by 2 
A, — event that the number is divisible by 3 
A, = event that the number is divisible by 5 
Then the desired probability is 


Р[(А, о 45) ^ Ai] 





P(A A5|A;)— Eq. (1.39 
(А, о A5| 4j) P(A;) [Eq. (1.39)] 
PU(A; ^ А) v (As ^ A3] 
= ——_—————— [Eq. 0.12)] 
P(A) 
P(A} ^ А) + P(A; ^ Aj) - Р(Аз ^ A; A A4) 
= —————— Eq. (1.29 
P(A.) LEq. (1.29)] 
Now Аз ^ A, = event that the number is divisible by 6 
As N A, = event that the number is divisible by 10 
A, As ^ А, = event that the number is divisible by 30 
and P(A, ^ А») = d P(A; ^ Aj) = is P(A, ^ As ^ Aj) = тё 
Mc 105 — тёр 2 
Thus, P(A; o Ag| Ay) = 7992-2190 — 108 _ = _ 046 
188 50 


Let A,, A5, ..., A, be events in a sample space S. Show that 
P(A, ^ A50 A A) = P(A)P(A;] A)P(A,]A, ^ A) -: PAIA OA m 5: А, )) 
(1.81) 


We prove Eq. (1.81) by induction. Suppose Eq. (1.81) is true for n — k: 
P(A, ^ А, су, A A) = P(A,)P(A|A,)P(A3] 44.0 A) c РАДА OANA су AV) 
Multiplying both sides by Р(А,. 1А, O А, ^ °°: ^ A) we have 
P(A, VA; 0-0 AJP(A, lA 0 А, о А) = P(A, с\ А, пе Ару) 
and P(A, A; ncc A Au) = P(A PIA A DPI] А, су A o PlAga A OAO гу Ау) 


Thus, Eq. (1.81) is also true for n = k + 1. By Eq. (1.41), Eq. (1.81) is true for n = 2. Thus Eq. (1.81) is true 
for n z 2. 


Two cards are drawn at random from a deck. Find the probability that both are aces. 


Let A be the event that the first card is an ace, and B be the event that the second card is an ace. The 
desired probability is P(B ^ A). Since a card is drawn at random, P(A) = $. Now if the first card is an асе, 
then there will be 3 aces left in the deck of 51 cards. Thus P(B| A) = &. By Eq. (1.41), 


P(B ^ A) = P(B| A)P(A) = (35 $2) = тт 
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Check: 


By counting technique, we have 


Ө 
2 (43) 1 


me ODS GS nen m 
2 


1.46. There are two identical decks of cards, each possessing a distinct symbol so that the cards from 
each deck can be identified. One deck of cards is laid out in a fixed order, and the other deck is 
shuffled and the cards laid out one by one on top of the fixed deck. Whenever two cards with the 
same symbol occur in the same position, we say that a match has occurred. Let the number of 
cards in the deck be 10. Find the probability of getting a match at the first four positions. 


Let A;, i = 1, 2, 3, 4, be the events that a match occurs at the ith position. The required probability is 
P(A; ^ А, A Аз A Ад) 
By Eq. (1.81), 
P(A, ^ А, ^ A, ^ Ад) = P(A,)P(A,|A,)P(A3] A, A A2)P(Ag| 41 ^ А, ^ Ag) 


There are 10 cards that can go into position 1, only one of which matches. Thus, P(A,) = 75. P(A,| А,) is 
the conditional probability of a match at position 2 given a match at position 1. Now there are 9 cards left 
to go into position 2, only one of which matches. Thus, P(A,|A,) = 5. In a similar fashion, we obtain 
Р(А |А, ^ А,) = $ and P(A4|A, ^ А, ^ Ау) = 3. Thus, 


P(A, ^ А; ^ Ay A As) = (16050809) = z075 


TOTAL PROBABILITY 
1.47. Verify Eq. (1.44). 
Since B ^ S = B [and using Eq. (1.43)], we have 
В= Во 5 = В л (A, о А, U + vA, 
= (Вл Ау) У (Вп А) о U(BOA,) (1.82) 
Now the events B ^ А,, i = 1,2, ..., п, are mutually exclusive, as seen from the Venn diagram of Fig. 1-14. 


Then by axiom 3 of probability and Eq. (1.41), we obtain 


P(B) = P(B ^ 5) = Y, P(B ^ A) = Y, BI A)PLA) 


i 1 


BOA, B 
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1.48. Show that for any events A and B in S, 
P(B) = P(B| A)P(A) + P(B| A)P(A) (1.83) 
From Eq. (1.64) (Prob. 1.23), we have 
P(B) = P(B ^n A) + Р(В n A) 
Using Eq. (1.39), we obtain 
P(B) = P(B| AJP(A) + P(B| A)P(A) 


Note that Eq. (1.83) is the special case of Eq. (1.44). 


1.49. Suppose that a laboratory test to detect a certain disease has the following statistics. Let 


A =event that the tested person has the disease 
В = event that the test result is positive 


It is known that 
P(B| A) = 0.99 and P(B| A) = 0.005 


and 0.1 percent of the population actually has the disease. What is the probability that a person 
has the disease given that the test result is positive? 


From the given statistics, we have 
P(A) = 0.001 then P(A) = 0.999 
The desired probability is P(A | B). Thus, using Eqs. (1.42) and (1.83), we obtain 


_ P(BIAP(A) ——— 
P(A|B) = РВ | A)P(A) + P(B| A)P(A) 
(0.99\0.001) 


= (0.990.001) + (00050999) - 9165 





Note that in only 16.5 percent of the cases where the tests are positive will the person actually have the 
disease even though the test is 99 percent effective in detecting the disease when it is, in fact, present. 


1.50. А company producing electric relays has three manufacturing plants producing 50, 30, and 20 
percent, respectively, of its product. Suppose that the probabilities that a relay manufactured by 
these plants is defective are 0.02, 0.05, and 0.01, respectively. 


(а) Ifa relay is selected at random from the output of the company, what is the probability that 
it is defective? 

(b) Ifa relay selected at random is found to be defective, what is the probability that it was 
manufactured by plant 2? 


(a) Let B be the event that the relay is defective, and let А, be the event that the relay is manufactured by 
plant i (i = 1, 2, 3). The desired probability is P(B). Using Eq. (1.44), we have 


3 
P(B) = Y, P(B| A)P(A) 
{= 1 


= (0.02X0.5) + (0.05)(0.3) + (0.01)(0.2) = 0.027 


30 


1.51. 


1.52. 


PROBABILITY [CHAP 1 


(b) The desired probability is Р(А„ | B). Using Eq. (1.42) and the result from part (а), we obtain 


Two numbers are chosen at random from among the numbers 1 to 10 without replacement. Find 
the probability that the second number chosen is 5. 

Let A,;, i= 1, 2, ..., 10 denote the event that the first number chosen is i. Let B be the event that the 
second number chosen is 5. Then by Eq. (1.44), 


10 


P(B) = », P(B| AJP(A) 
i=l 


Now Р(А,) = 15. P(B} Aj) is the probability that the second number chosen is 5, given that the first is i. If 
i = 5, then P(B| А) = 0. If i 4 5, then P(B| Aj) = 4. Hence, 


10 


P(B) = У P(B| A)P(A) = 905075) = тб 


Consider the binary communication channel shown іп Fig. 1-15. The channel input symbol Х 
may assume the state 0 or the state 1, and, similarly, the channel output symbol Y may assume 
either the state 0 or the state 1. Because of the channel noise, an input 0 may convert to an 
output 1 and vice versa. The channel is characterized by the channel transition probabilities po, 
40, Pi, and q,, defined by 


Po = P(yil xo) and Py = P(yolxi) 
Go = P(yolxo) and qı = P(yilxi) 
where хо and x, denote the events (X = 0) and (X = 1), respectively, and yọ and y, denote the 
events (Y = 0) and (Y = 1), respectively. Note that po + qo = 1 = p, + q,. Let Р(х) = 0.5, po = 
0.1, and p, = 02. 
(a) Find P(yo) and P(y,). 
(b) Шад was observed at the output, what is the probability that a 0 was the input state? 
(c) Ifa 1 was observed at the output, what is the probability that a 1 was the input state? 
(d) Calculate the probability of error P,. 


40 


(а) We note that 
P(x,) = 1 — Р(х) = 1— 0.5 = 0.5 
P(yg|xg) = 4 = 1 — po = 1 - 0.1 = 09 
Р(у (x) =4, = 1-р = 1-02 = 0.8 
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Using Eq. (1.44), we obtain 
P(yo) = Р(уо| xo)P(xo) + Р(у | xi)P(xi) = 0.9(0.5) + 0.2(0.5) = 0.55 
P(y,) = Р(у, | xo)P(xo) + P(y, | xi) P(x,) = 0.1(0.5) + 0.8(0.5) = 0.45 
(b) Using Bayes’ rule (1.42), we have 


Р(хо)Р(уо| Xo) _ (0.5)(0.9) 


P(o) oss = 0818 





P(xo| yo) = 


(c) Similarly, 


Plxi)P(yi 1x1) _ (0.50.8) 
P(y,) 045 





P(x, | yi) = 


(d) The probability of error is 
P, = Р(у, | хо)Р(хо) + P(yg| xy)P(x) = 0.1(0.5) + 0.2(0.5) = 0.15. 


INDEPENDENT EVENTS 
1.53. Let A and B be events in a sample space S. Show that if A and B are independent, then so are (a) 
A and B, (b) A and B, and (c) A and B. 
(a) From Eq. (1.64) (Prob. 1.23), we have 
P(A) = P(A ^ B) + P(A ^ B) 
Since A and B are independent, using Eqs. (/.46) and (1.25), we obtain 
P(A ^ В) = P(A) — P(A ^ В) = P(A) — P(A)P(B) 
= P(A)[1 — Р(В)] = P(A)P(B) (1.84) 
Thus, by definition (1.46), A and B are independent. 
(b) Interchanging A and B in Eq. (1.84), we obtain 
P(B ^ Д) = P(B)P(A) 
which indicates that A and B are independent. 
(c) We have 
P(A ^ В) = P((A o В)] (Eq. (1.14)] 
| — P(A о B) [Eq. (1.25)] 
L— PIA) — PB) + P(A B) (Eq. (1.29)] 
| — P(A) — P(B) + P(A)P(B) [Eq. (1.46)] 
1 — P(A) — P(B 1 — P(A)) 
-[1- PD — Р(В)] 
= P(A)P(B) (Eq. (/.25)] 
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Hence, 4 апа B аге independent. 


1.54. Let А and B be events defined in a sample space S. Show that if both P(A) and P(B) are nonzero, 
then events A and B cannot be both mutually exclusive and independent. 


Let A and B be mutually exclusive events and P(A) #0, P(B) #0. Then P(A ^ B) = P(Ø) = 0 but 
P(A)P(B) # 0. Since 


P(A ^ B) x P(A)P(B) 


A and B cannot be independent. 


1.55. Show that if three events A, B, and С are independent, then A and (B о C) are independent. 
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We have 
P[A ^ (B о Cy] = PHA ^ В) о (А ^ C)] [Eq. (1.12)] 
= Р(А с\ B)-P(AnC)-PAnBncC)  [Eq.(129)] 
= P(A)P(B) + P(A)P(C) — P(A)P(B)P(C) (Eq. (1.50)] 
= P(A)P(B) + Р(А)Р(С) — P(A)P(B n С) (Eq. (1.50)] 
= P(A)[P(B) + P(C) — P(B ^ С)] 
= P(A)P(B o С) (Eq. (1.29)] 


Thus, А and (B \› C) are independent. 


Consider the experiment of throwing two fair dice (Prob. 1.31). Let A be the event that the sum 
of the dice is 7, B be the event that the sum of the dice is 6, and C be the event that the first die is 
4. Show that events A and С are independent, but events В and С are not independent. 

From Fig. 1-3 (Prob. 1.5), we see that 


A = ie Cass S34 баз» 652 Ser} 
B = (015, 62а, Саз» Sars Ssh 
C = {bar Gans баз» Caas Cas бав) 


and ANC= {S43} BoCs= {Car} 
Now Р(А) = 6 = 1 P(B) = X ВС) = 5 = & 
апа P(A ^ С) = gg = Р(А)Р(С) 


Thus, events A апа С аге independent. But 
P(B ^ С) = 3s # P(B)P(C) 


Thus, events B and C are not independent. 


In the experiment of throwing two fair dice, let A be the event that the first die is odd, B be the 
event that the second die is odd, and C be the event that the sum is odd. Show that events A, B, 
and C are pairwise independent, but A, B, and C are not independent. 


From Fig. 1-3 (Prob. 1.5), we see that 
P(A) = P(B) = PC = 38 =} 
P(A ^ В) = PA n С) = Р(В п С)=з = 1 
Thus P(A ^ В = 4 = P(A)P(B) 
P(A ^ С) = = Р(А)Р(С) 
P(B n С) = į = P(B)P(C) 


which indicates that A, B, and C are pairwise independent. However, since the sum of two odd numbers is 
even, {A n B ^ С) = Ø and 


P(A су BO С) = 0 = § = P(A)P(B)P(C) 


which shows that A, B, and C are not independent. 


A system consisting of n separate components is said to be a series system if it functions when all 
n components function (Fig. 1-16). Assume that the components fail independently and that the 
probability of failure of component i is pj, i= 1, 2, ..., n. Find the probability that the system 


functions. 


Fig. 1-16 Series system. 
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Let A; be the event that component s; functions. Then 
P(A) = 1 — P(A) = 1 — р, 


Let A be the event that the system functions. Then, since A;'s are independent, we obtain 


n n 


P(A) = e( (\a))= [] P(A) = 01 — p) (1.85) 
ї= 1 iz] 


i= i=] 


A system consisting of n separate components is said to be a parallel system if it functions when 
at least one of the components functions (Fig. 1-17). Assume that the components fail indepen- 
dently and that the probability of failure of component i is pj, i = 1, 2, ..., n. Find the probabil- 
ity that the system functions. 





Fig. 1-17 Parallel system. 


Let A; be the event that component s; functions. Then 
Р(А) = pi 


Let A be the event that the system functions. Then, since 4,5 are independent, we obtain 


навал): Г», (1.86) 
{= 1 i=l 


Using Eqs. (1.85) and (1.86), redo Prob. 1.34. 


From Prob. 1.34, р; = 5, i= 1, 2, 3, 4, where p, is the probability of failure of switch s;. Let A be the 
event that there exists a closed path between a and b. Using Eq. (1.86), the probability of failure for the 
parallel combination of switches 3 and 4 is 

Рза = рз Pa = (300) = 4 
Using Eq. (1.85), the probability of failure for the combination of switches 2, 3, and 4 is 





Рза=1—(1—(1—-1)=1—-=% 
Again, using Eq. (1.86), we obtain 


P(A) = 1 ~ рур = 1 0 1-6 В 


A Bernoulli experiment is a random experiment, the outcome of which can be classified in but 
one of two mutually exclusive and exhaustive ways, say success or failure. A sequence of Ber- 
noulli trials occurs when а Bernoulli experiment is performed several independent times so that 
the probability of success, say p, remains the same from trial to trial. Now an infinite sequence of 
Bernoulli trials is performed. Find the probability that (a) at least 1 success occurs in the first л 
trials; (b) exactly k successes occur in the first n trials; (c) all trials result in successes. 


(a) In order to find the probability of at least 1 success in the first n trials, it is easier to first compute the 
probability of the complementary event, that of no successes in the first n trials. Let 4; denote the event 
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of a failure on the ith trial. Then the probability of no successes is, by independence, 
P(A, ^ A; At A А,) = P(A, P(A) «++ P(A,) = (1 — р)" (1.87) 
Hence, the probability that at least 1 success occurs in the first n trials is 1 — (1 — р)". 


(b) In any particular sequence of the first n outcomes, if k successes occur, where К = 0, 1, 2, ..., n, then 


n — k failures occur. There are (7) such sequences, and each опе of these has probability p'(1 — р)“. 


k 
Thus, the probability that exactly k successes occur in the first n trials is given by (bea — р)", 


(c) Since A; denotes the event of a success on the ith trial, the probability that all trials resulted in 
successes in the first n trials is, by independence, 
P(A, 0 A, 0-0 A) = Р(А,)Р(А,) «++ P(A,) = р" (1.88) 


Hence, using the continuity theorem of probability (/.74) (Prob. 1.28), the probability that all trials 
result in successes is given by 


eĝa) = (tim a à) = lim йа) = limp" = t Á < | 


n>x i=l no nox 


Let S be the sample space of an experiment and S — (A, B, C], where P(A) — p, P(B) — q, and 
P(C) = r. The experiment is repeated infinitely, and it is assumed that the successive experiments 
are independent. Find the probability of the event that A occurs before B. 


Suppose that A occurs for the first time at the nth trial of the experiment. If A is to have occurred 
before B, then C must have occurred on the first (n — 1) trials. Let D be the event that A occurs before B. 
Then 


where D, is the event that C occurs on the first (n — 1) trials and A occurs on the nth trial. Since D,'s are 
mutually exclusive, we have 


Since the trials are independent, we have 


P(D,) = [Р(С)]' P(A) = т" 1р 





Thus, PD) Y p pap y РР 
ns 0) lr P pl l-r p+q 
_ P(A) 
or P(D) = PAY PB) (1.89) 


sincep+q+r=l. 


In a gambling game, craps, a pair of dice is rolled and the outcome of the experiment is the sum 
of the dice. The player wins on the first roll if the sum is 7 or 11 and loses if the sum is 2, 3, or 12. 
If the sum is 4, 5, 6, 8, 9, or 10, that number is called the player’s “point.” Once the point is 
established, the rule is: If the player rolls a 7 before the point, the player loses; but if the point is 
rolled before a 7, the player wins. Compute the probability of winning in the game of craps. 


Let A, B, and C be the events that the player wins, the player wins on the first roll, and the player gains 
point, respectively. Then P(A) = P(B) + P(C). Now from Fig. 1-3 (Prob, 1.5), 


P(B) = P(sum = 7) + Psum = 11)=%4+4%=% 
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Let A, be the event that point of k occurs before 7. Then 
P(C) — Y P(A,)P(point = k) 


kels, 5. 6, 8. 9, 10) 


By Eq. (1.89) (Prob. 1.62), 








PA) P(sum = k} 1.90) 
^ Pisum = k) + P(sum = 7) u. 
Again from Fig. 1-3, 
Pisum = 4) = $ P(sum = 5) – X P(sum = 6) — $ 
P(sum = 8) = $ P(sum = 9) = $ P(sum = 10) = X 
Now by Eq. (1.90), 
Р(А4) = 5 PIA) = à P(Ag = d 
P(Ag) = тг P(A) = $ Р(А 0) = 4 


Using these values, we obtain 


Supplementary Problems 


Consider the experiment of selecting items from a group consisting of three items (a, b, с}. 


(a) Find the sample space S, of the experiment in which two items are selected without replacement. 


(b) Find the sample space S, of the experiment in which two items are selected with replacement. 
Ans. (a) S, = (ab, ac, ba, be, ca, cb] 

(b) S, laa, ab, ас, ba, bb, be, ca, cb, cc] 
Let A and B be arbitrary events. Then show that А c B if and only if A ùu B =B. 


Hint: Draw a Venn diagram. 


Let A and B be events in the sample space S. Show that if A c B, then B c A. 


Hint: Draw a Venn diagram. 


Verify Eq. (1.13). 


Hint: Draw a Venn diagram. 


Let A and B be any two events in S. The difference of B and A, denoted by B — A, is defined as 
B-A-BnA 
The symmetric difference of A and B, denoted by A A B, is defined by 
A A B=(A – В) о (B -- А) 
Show that 
AAB=(Au В) а (А с\ В) 


Hint: Draw а Venn diagram. 
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1.69. 


1.70. 


1.71. 


1.72, 


1.73. 


1.74. 


1.75. 


1.76. 


1.77. 


1.78. 


PROBABILITY [CHAP 1! 


Let A and B be any two events in S. Express the following events in terms of A and B. 
(а) At least one of the events occurs. 


(b) Exactly one of two events occurs. 


Ans. (а) А о B;(D AA В 


Let A, B, and С be any three events in S. Express the following events in terms of these events. 


(a) Either B or C occurs, but not A. 
(b) Exactly one of the events occurs. 
(c) Exactly two of the events occur. 


Ans. (а) AN(BUC) 

b) {Ал (Во Су о {в л (Ао Су {Ca (A vu B) 

() (Ас Вус Сү (Ас Сус BYU (Вс Сус А) 

A random experiment has sample space S = (a, b, c). Suppose that P({a, c]) = 0.75 and P({b, c)} = 0.6. 
Find the probabilities of the elementary events. 

Ans. Р(а) = 0.4, P(b) = 0.25, P(c) = 0.35 

Show that 

(à РД о B)=1— РА г В) 

(b P(A ^ В) > 1 — P(A) — P(B) 

(c) P(A A B) = P(A о B) — P(A ^ B) 


Hint: (a) Use Eqs. (1.15) and (1.25). 
(b) Use Eqs. (1.29), (1.25), and (1.28). 
(c) See Prob. 1.68 and use axiom 3. 


Let А, B, and С be three events in S. If P(A) = P(B) = 1, Р(С) = 4, P(A ^ В) = 4, P(A ^ C) = 1, and 
P(B ^ C) = 0, find PA o B o C) 


Ans. 1i 


Verify Eq. (1.30). 


Hint: Prove by induction. 


Show that 
P(A, ^A; с А, > P(A) + P(A) ++ Р(А,) – (п — 1) 
Hint: Use induction to generalize Bonferroni's inequality (1.63) (Prob. 1.22). 
In an experiment consisting of 10 throws of a pair of fair dice, find the probability of the event that at least 
one double 6 occurs. 


Ans, 0.246 


Show that if P(A) > P(B), then P(A | B) > P(B| A). 

Hint: Use Eqs. (1.39) and (1.40). 

An urn contains 8 white balls and 4 red balls. The experiment consists of drawing 2 balls from the urn 
without replacement. Find the probability that both balls drawn are white. 


Ans. 0.424 
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1.79. 


1.80. 


1.81. 


1.82. 


There are 100 patients in a hospital with a certain disease. Of these, 10 are selected to undergo a drug 
treatment that increases the percentage cured rate from 50 percent to 75 percent. What is the probability 
that the patient received a drug treatment if the patient is known to be cured? 


Ans. 0.143 

Two boys and two girls enter a music hall and take four seats at random in a row. What is the probability 
that the girls take the two end seats? 

Ans i 

Let A and B be two independent events in S. It is known that P(A ^ В) = 0.16 and P(A v B) = 0.64. Find 
P(A) and P(B). 

Ans, P(A) = P(B) = 0.4 

The relay network shown in Fig. 1-18 operates if and only if there is a closed path of relays from left to 


right. Assume that relays fail independently and that the probability of failure of each relay is as shown. 
What is the probability that the relay network operates? 


Ans. 0.865 





Fig. 1-18 


Chapter 2 





Random Variables 


21 INTRODUCTION 


In this chapter, the concept of a random variable is introduced. The main purpose of using a 
random variable is so that we can define certain probability functions that make it both convenient 
and easy to compute the probabilities of various events. 


22 RANDOM VARIABLES 
A. Definitions: 


Consider a random experiment with sample space S. A random variable X(C) is а single-valued 
real function that assigns a real number called the value of X(Q) to each sample point ¢ of 5. Often, we 
use a Single letter X for this function in place of X(¢) and use r.v. to denote the random variable. 

Note that the terminology used here is traditional. Clearly a random variable is not a variable at 
all in the usual sense, and it is a function. 

The sample space S is termed the domain of the r.v. X, and the collection of all numbers [values 
of X({)] is termed the range of the r.v. X. Thus the range of X is a certain subset of the set of all real 
numbers (Fig. 2-1). 

Note that two or more different sample points might give the same value of X(Q), but two differ- 
ent numbers in the range cannot be assigned to the same sample point. 


xX) R 


Fig. 2-1 Random variable X as a function. 


EXAMPLE 2.1 [n the experiment of tossing a coin once (Example 1.1), we might define the r.v. X as (Fig. 2-2) 
X(H) = 1 X(T)=0 
Note that we could also define another r.v., say Y or Z, with 


Y(H) 20, Y(T)=1 or Z(H)=0,2(T)= 


B. Events Defined by Random Variables: 
If X is a r.v. and x is a fixed real number, we can define the event (X = x) as 
(X = х) = (6: X(0) = x} (2.1) 
Similarly, for fixed numbers x, x,, and x;, we can define the following events: 


(X x x) = (€: X(0 <x} 
(X > x) = (€ X(0 > x} (2.2) 
(x, < Х< х) = {6х < XQ) x xl 
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0 1 R 


Fig. 2-22 One random variable associated with coin tossing. 


These events have probabilities that are denoted by 
Р(Х = x) = P(t: X(Q) = x} 
P(X <x) = P(t: X( < x} 
P(X > x) = P(t: X(Q) > x} 
P(x, < X < x2) = P{E: x, < Хб) x х) 


У Чч 


(2.3) 


EXAMPLE 2.2 In the experiment of tossing a fair coin three times (Prob. 1.1), the sample space 5, consists of 
eight equally likely sample points S, = { ННН, ..., TTT}. If X is the r.v. giving the number of heads obtained, find 
(a) Р(Х = 2); (b) P(X < 2). 


(a) Let A c S, be the event defined by X = 2. Then, from Prob, 1.1, we have 
A —(X = 2) = (C: X(0 22) = (HHT, HTH, THH} 
Since the sample points are equally likely, we have 
P(X = 2) = P(A) = $ 
(b) Let B c S, be the event defined by X < 2. Then 
В=(Х < 2) = {0: X) < 2} = (HTT, ТНТ, TTH, TTT} 
and P(X < 2) = Р(В) = $ = 5 


23 DISTRIBUTION FUNCTIONS 
A. Definition: 
The distribution function [or cumulative distribution function (cdf)] of X is the function defined by 
Fy(x) = P(X < x) -0 <X< о (2.4) 


Most of the information about a random experiment described by the r.v. X is determined by the 
behavior of F g(x). 


B. Properties of Fx): 


Several properties of F у(х) follow directly from its definition (2.4). 


1. 0< Fx) <1 (2.5) 
2. Fy(x,) € Ех.) if x, < x; (2.6) 
3. lim Fy(x) = Fy(co) = 1 (2.7) 
4. lim Fy(x) = Fy(—00)20 (2.8) 
5. lim Fy(x) = Еа?) = Fa) at= lm а+& (2.9) 


x>at 0«c20 
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Property 1 follows because F,(x) is a probability. Property 2 shows that F(x) is a nondecreasing 
function (Prob. 2.5). Properties 3 and 4 follow from Eqs. (1.22) and (1.26): 
lim P(X < x) = P(X < oo) = Р($) = 1 


lim P(X < x)= Р(Х € —оо) = Р(@)=0 


x>- 0 


Property 5 indicates that F,(x) is continuous on the right. This is the consequence of the definition 
(2.4). 


Table 2.1 
x Fy(x) 
0 (TTT) i 
1 (TTT, TTH, THT, HTT} #= 1 
2 {TTT, ТТН, ТНТ, HTT, HHT, HTH, THR} i 
3 S 1 
4 S 1 





EXAMPLE 2.3 Consider the г.у. X defined in Example 2.2. Find and sketch the cdf F(x) of X. 

Table 2.1 gives Fy(x) = Р(Х < x) for x = — 1,0, 1, 2, 3, 4. Since the value of X must be an integer, the value of 
F(x) for noninteger values of x must be the same as the value of Fy(x) for the nearest smaller integer value of x. 
The F(x) is sketched in Fig. 2-3. Note that Fy(x) has jumps at x = 0, 1, 2, 3, and that at each jump the upper value 
is the correct value for F(x). 


Fx 





-l 0 | 2 3 4 x 


Fig. 2-3 


C. Determination of Probabilities from the Distribution Function: 


From definition (2.4), we can compute other probabilities, such as P(a < X < b), P(X > a), and 
P(X < b) (Prob. 2.6): 


Pla < X € b) = Fy(b) — F x(a) (2.10) 
P(X > a) = 1 — Fy(a) (2.11) 
P(X < b) = Fy(b^) b= lim b—e (2.12) 
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2.4 DISCRETE RANDOM VARIABLES AND PROBABILITY MASS FUNCTIONS 
A. Definition: 


Let X be a r.v. with cdf Fy(x). If F(x) changes values only in jumps (at most a countable number 
of them) and is constant between jumps—that is, F у(х) is a staircase function (see Fig. 2-3)— then X 
is called a discrete random variable. Alternatively, X is a discrete r.v. only if its range contains a finite 
or countably infinite number of points. The r.v. X in Example 2.3 is an example of a discrete r.v. 


B. Probability Mass Functions: 


Suppose that the jumps in Fy(x) of a discrete r.v. X occur at the points ху, x2, ..., where the 
sequence may bc either finite or countably infinite, and we assume х; < x; if i <j. 
Then Fax) — Fy(x;- 1.) = P(X < x) — P(X € х,у) = P(X = x) (2.13) 
Let px(x) = Р(Х = x) (2.14) 


The function p,(x) is called the probability mass function (pmf) of the discrete r.v. X. 


Properties of p(x): 
1. O< pylxy) <1 k=1,2,... (2.15) 
2. рух) = 0 if x # x, (k = 1, 2,...) (2.16) 
3. у Px(X,) = 1 (2.17) 


The cdf F у(х) of a discrete r.v. X can be obtained by 
Fy(x) = P(X € x) = У, paxa (2.18) 


XkSx 


25 CONTINUOUS RANDOM VARIABLES AND PROBABILITY DENSITY FUNCTIONS 
A. Definition: 


Let X be a r.v. with cdf Fy(x). If Fy{x) is continuous and also has a derivative dFy(x)/dx which 
exists everywhere except at possibly a finite number of points and is piecewise continuous, then X is 
called a continuous random variable. Alternatively, X is a continuous r.v. only if its range contains an 
interval (either finite or infinite) of real numbers. Thus, if X is a continuous r.v., then (Prob. 2.18) 


Р(Х =х)=0 (2.19) 
Note that this is an example of an event with probability 0 that is not necessarily the impossible event 
In most applications, the г.у. is either discrete or continuous. But if the cdf Fy(x) of a rv. X 


possesses features of both discrete and continuous r.v.'s, then the r.v. X is called the mixed r.v. (Prob. 
2.10). 


B. Probability Density Functions: 


_ dF (x) 


Let Sx) dx 


(2.20) 


The function f,(x) is called the probability density function (pdf) of the continuous r.v. X. 
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Properties of f(x): 

1. f(x) z 0 

2. F fx(x) dx = 1 

3. fy(x)is piecewise continuous. 

4 Pla<X <Б) = [ле dx 

The cdf F(x) of a continuous r.v. X can be obtained by 


Fy(x) = Р(Х < x)= F Л аё 


By Eq. (2.19), if X is a continuous r.v., then 


P(a < X < b) = Pla < X <b) = Pax X «b)- Pla< X <b) 


= [ fx(x) dx = Fy(b) — F(a) 


2.6 MEAN AND VARIANCE 
A. Mean: 


The mean (or expected value) of a r.v. X, denoted by uy or E(X), is defined by 


У x, px) X: discrete 
By = E(X) = ^ oo 
| Х/у(х) ах -X: continuous 


B. Moment: 


The nth moment of a r.v. X is defined by 


У x px (x) X: discrete 
E(X") = k » 


x"f(x)dx X: continuous 


— 00 


Note that the mean of X is the first moment of X. 


C. Variance: 
The variance of a r.v. X, denoted by a,” or Var(X), is defined by 
ox? = Var(X) = E(LX — Е(Х)]?} 
Thus, 
Y (Xx = ux)! Pxlx,) X: discrete 


k 
oo 


Ox = 
| (x — py)? f(x) dx X: continuous 
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(2.21) 


(2.22) 


(2.23) 


(2.24) 


(2.25) 


(2.26) 


(2.27) 


(2.28) 


(2.29) 
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Note from definition (2.28) that 
Var(X) > 0 (2.30) 


The standard deviation of a r.v. X, denoted by ax, is the positive square root of Var(X). 
Expanding the right-hand side of Eq. (2.28), we can obtain the following relation: 


Var(X) = E(X*) — [E(X)]* (2.31) 


which is a useful formula for determining the variance. 


27 SOME SPECIAL DISTRIBUTIONS 


In this section we present some important special distributions. 


A. Bernoulli Distribution: 
A г.у. X is called a Bernoulli r.v. with parameter p if its pmf is given by 
pík) = Р(Х = k) = p(l ~ p) ~ k=0,1 (2.32) 
where 0 < p < 1. By Eq. (2.18), the cdf F x(x) of the Bernoulli r.v. X is given by 


0 х<0 
Е у(х) = 41-р O<x<] (2.33) 
1 х > 1 
Figure 2-4 illustrates a Bernoulli distribution. 
рух! Күз) 





Fig. 2-4 Bernoulli distribution. 


The mean and variance of the Bernoulli r.v. X are 


нх = ЕХ) = р (2.34) 
ex? = Var(X) = pil — р) (2.35) 
A Bernoulli r.v. X is associated with some experiment where an outcome can be classified as 


either a “success” or a “failure,” and the probability of a success is p and the probability of a failure is 
1 — p. Such experiments are often called Bernoulli trials (Prob. 1.61). 
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B. Binomial Distribution: 


А г.у. X is called a binomial г.у. with parameters (n, p) if its pmf is given by 


p(k) = P(X = 0) = (bra —-p*  k-0,L...,n 


п\_ n! 
k} ki(n к)! 


which is known as the binomial coefficient. The corresponding cdf of X is 


where 0 € p < | and 


Ех) = 5, (c -p'* nzx«n«l 


Figure 2-5 illustrates the binomial distribution for n = бапа p = 0.6. 


Fx 
pen 19 0.9533 
1 
! 
i4 OX | ! 
„—— 
03110 
0л 0.2765 m 
0.2 0.1866 оз 
0.1382 
el 02 
0.0169 ане? 

0004116-—--1 - L. шо. . ool | 
u i 2 à 1 5 f t s 





(a) 


Fig. 2-8 Binomial distribution with n = 6, p = 0.6. 


The mean and variance of the binomial r.v. X are (Prob. 2.28) 


Ky = ЕХ) = np 
oy? = Var(X} = np(1 — p) 
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(2.36) 


(2.37) 


(2.38) 
(2.39) 


A binomial r.v. X is associated with some experiments in which n independent Bernoulli trials are 
performed and X represents the number of successes that occur in the n trials. Note that a Bernoulli 


r.v. is just a binomial r.v. with paramcters (1, р). 


C. Poisson Distribution: 


А c.v. X is called a Poisson г.у. with parameter 2 (> 0) if its pmf is given by 


A* 
pdk) = P(X = К) = е ° и 


k=0,1,... 
The corresponding cdf of X is 


Рү(х} = е^ 


A* 
"T п< хеп +1 


n 
k-0 


Figure 2-6 illustrates the Poisson distribution for А = 3. 


(2.40) 


(2.41) 
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0.1992 
Фә 






0.0216 0.0081 






Fig. 2-6 Poisson distribution with A = 3. 


The mean and variance of the Poisson r.v. X are (Prob. 2.29) 


Hx = E(X) =A 
ay? = Var(X) = 4 


0.4232 


0.9664 


ai 


0.988 0.996) 
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(2.42) 
(2.43) 


The Poisson r.v. has a tremendous range of applications in diverse areas because it may be used 
as an approximation for a binomial r.v. with parameters (n, p) when n is large and p is small enough 


so that np is of a moderate size (Prob. 2.40). 
Some examples of Poisson r.v.'s include 


The number of telephone calls arriving at a switching center during various intervals of time 


2. The number of misprints on a page of a book 


The number of customers entering a bank during various intervals of time 





D. Uniform Distribution: 
A r.v. X is called a uniform r.v. over (a, b) if its pdf is given by 
1 
уо) = boa a<x<b 
0 otherwise 
The corresponding cdf of X is 
0 xa 
Ех) = 5 а<х <Б 
1 x > Б 


Figure 2-7 illustrates a uniform distribution. 
The mean and variance of the uniform r.v. X are (Prob. 2.31) 


a+b 
Hy = E(X) = 5 








(2.44) 


(2.45) 


(2.46) 


(2.47) 
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Fig. 2-7 Uniform distribution over (a, b). 


A uniform r.v. X is often used where we have no prior knowledge of the actual pdf and all 
continuous values in some range seem equally likely (Prob. 2.69). 


E. Exponential Distribution: 
А r.v. X is called an exponential r.v. with parameter A (> 0) if its pdf is given by 
e^  x»0 
= 2.48 
ful) n c0 (248) 
which is sketched in Fig. 2-8(a). The corresponding cdf of X is 
)]-e ^ x20 


2.49 
0 x«0 ( 


F x) = | 
which is sketched in Fig. 2-8(Р). 


д Fy) 





(b) 


Fig. 2-8 Exponential distribution. 
The mean and variance of the exponential r.v. X are (Prob. 2.32) 


их = E(X) = (2.50) 


Poe ed 


oy? = Var(X) = y (2.51) 


The most interesting property of the exponential distribution is its “memoryless” property. By 
this we mean that if the lifetime of an item is exponentially distributed, then an item which has been 
in use for some hours is as good as a new item with regard to the amount of time remaining until the 
item fails. The exponential distribution is the only distribution which possesses this property (Prob. 
2.53). 
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F. Normal (or Gaussian) Distribution: 


А r.v. X is called a normal (or gaussian) r.v. if its pdf is given by 








fax) = e7 Weta?) 2.52 
x Vino (2.52) 
The corresponding cdf of X is 
Ер) = —= | eum де o Le [seem де (2.53) 
= = —_ e ! .54 
i Оло -o 2n x 


This integral cannot be evaluated in a closed form and must be evaluated numerically. It is conve- 
nient to use the function dz), defined as 








1 2 
Ф(г) = | e 9n dk (2.54) 
Vf 25 J. 
to help us to evaluate the value of F(x). Then Eq. (2.53) can be written as 
F,(x) = af? - r) (2.55) 
Note that 
Ф(—2) = 1 — Ф(2) (2.56) 


The function Ф(2) is tabulated in Table A (Appendix A). Figure 2-9 illustrates a normal distribution. 


fled Fy 








(а) (b) 


Fig. 2-9 Normal distribution. 


The mean and variance of the normal r.v. X are (Prob. 2.33) 


их = E(X) = p (2.57) 
сұ? = Var(X) = о? (2.58) 


We shall use the notation N(y; с?) to denote that X is normal with mean p and variance o?. A 
normal r.v. Z with zero mean and unit variance—that is, Z = N(0; 1}—is called a standard normal r.v. 
Note that the cdf of the standard normal r.v. is given by Eq. (2.54). The normal r.v. is probably the 
most important type of continuous r.v. It has played a significant role in the study of random pheno- 
mena in nature. Many naturally occurring random phenomena are approximately normal. Another 
reason for the importance of the normal r.v. is a remarkable theorem called the central limit theorem. 
This theorem states that the sum of a large number of independent r.v.'s, under certain conditions, 
can be approximated by a normal r.v. (see Sec. 4.8C). 
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2.8 CONDITIONAL DISTRIBUTIONS 
In Sec. 1.6 the conditional probability of an event 4 given event B is defined as 


P(A ^ B) 


P(A|B) = рву 


P(B) > 0 


The conditional cdf F x(x | B) of a r.v. X given event B is defined by 
P((X < x) n B} 
P(B) 


The conditional cdf F4(x|B) has the same properties as F,(x). (See Prob. 1.37 and Sec. 2.3) In 
particular, 


Еү(х|В) = Р(Х € x|B) = (2.59) 


Fy(— с | В) = 0 Е,Доо | В) = 1 (2.60) 

P(a < X < b| B) = Fy(b| B) — F la | В) (2.61) 
If X is a discrete r.v., then the conditional pmf p,(x, | B) is defined by 
P((X = x,) n B] 


Px(x,|B) = P(X = x,|B) = PLB) (2.62) 
If X is a continuous r.v., then the conditional pdf fy(x | B) is defined by 
dF y(x|B 
fx} В) = К (2.63) 
х 
Solved Problems 
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2.1. Consider the experiment of throwing a fair die. Let X be the r.v. which assigns 1 if the number 
that appears is even and 0 if the number that appears is odd. 


(a) What is the range of X? 
(b) Find P(X = 1) and P(X = 0). 
The sample space $ on which X is defined consists of 6 points which are equally likely: 
S = {1, 2, 3, 4, 5, 6} 
(a) The range of X is Ry = {0, 1}. 
(b) (X = 1) = {2, 4, 6}. Thus, P(X = 1) = $ = 4. Similarly, (X = 0) = {1, 3, 5}, and P(X = 0) = 4. 


2.2, Consider the experiment of tossing a coin three times (Prob. 1.1). Let X be the r.v. giving the 
number of heads obtained. We assume that the tosses are independent and the probability of a 
head is p. 


(a) What is the range of X? 
(b) Find the probabilities Р(Х = 0), P(X = 1), P(X = 2), and P(X = 3). 
The sample space S on which X is defined consists of eight sample points (Prob. 1.1): 
S = {ННН, HHT, ..., TTT} 
(а) The range of X is Ry = (0, 1, 2, 3}. 
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2.3. 


2.4. 


(b) If P(H) = p, then P(T) = 1 — p. Since the tosses are independent, we have 


P(X = 0) = P[{TTT}] =(1 — pP 

P(X = 1) = P[SHTT}] + РИТНТ)] + P[{TTH}] = 30 — р)2р 
P(X = 2) = P[{HHT}] + P[{HTH}] + P[{THH}] = 30 — pp? 
P(X = 3) = P[{HHH}] =p? 


An information source generates symbols at random from a four-letter alphabet (a, b, c, d) with 
probabilities P(a) = 4, P(b) = 4, and P(c) = P(d) = 4. A coding scheme encodes these symbols 
into binary codes as follows: 


a 0 

b 10 
c 110 
d 111 


Let X be the r.v. denoting the length of the code, that is, the number of binary symbols (bits). 


(a) What is the range of X? 

(b) Assuming that the generations of symbols are independent, find the probabilities P(X = 1), 
P(X = 2), P(X = 3), and P(X > 3). 

(a) The range of X is Ry = {1,2, 3]. 

(p P(X = 1) = P[{a}] = Pla) 
P(X = 2) = P[{b}] = Pb) = à 
P(X = 3) = P[{c, d}] = Р(с) + Pid) = 1 
P(X > 3) = P(Ø) = 0 


Consider the experiment of throwing a dart onto a circular plate with unit radius. Let X be the 

r.v. representing the distance of the point where the dart lands from the origin of the plate. 

Assume that the dart always lands on the plate and that the dart is equally likely to land 

anywhere on the plate. 

(a) What is the range of X? 

(b Find (i) P(X < a) and (ii) P(a < X < b, wherea« b x 1. 

(а) The range of X is Ry = (x:0xx« I}. 

(P) () (X < а) denotes that the point is inside the circle of radius a. Since the dart is equally likely to fall 
anywhere on the plate, we have (Fig. 2-10) 


2 
Р(Х < а) = 2 = а 
пі 


(i) (а < X < Б) denotes the event that the point is inside the annular ring with inner radius а and 
outer radius b. Thus, from Fig. 2-10, we have 
n(b? — a’) 
Е пі? 





P(a < X < b) =b — а? 


DISTRIBUTION FUNCTION 


2.5. 


Verify Eq. (2.6). 
Let x, < xz. Then (X € xj) is a subset of (X < x5); that is, (X < xi) c (X < x2). Then, by Eq. (1.27), 
we have 


Р(Х <х,)< P(X < xj or Fy(x4) < Fy(%2) 
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2.6. 


2.7. 


RANDOM VARIABLES 





Fig. 2-10 


Verify (a) Eq. (2.10); (b) Eq. (2.11); (c) Eq. (2.12). 
(a) Since(X « b) 2(X <a) о (a « X < b)and(X x a) n (a « X «x b) = Ø, we have 
P(X < Б) = P(X <a)+ Pla< X < Б) 
or ЕБ) = Еа) + Pla < X < b) 
Thus, Pla < X x b) = Fy(b) — F x(a) 
(b Since(X < a) (X > a) = Sand(X x a) n (X > a) = Qj, wc have 
P(X € a) + P(X > a) = Pi) = 1 


Thus, P(X > a)=1-— P(X < а) = 1 — Еа) 
(с) Now P(X < b) = P(lim X < b —e} = lim P(X x b — е) 
£0 е0] 
„> 0 к> 0 


= lim F4(b — е) = Fb”) 
к—*0 
£20 


Show that 
(a) P(a < X <b)= P(X = а) + Fx(b) — Еа) 
(b P(a« X « b) = Fy(b) — Fy(a) — P(X = b) 
(c) Plas X <Б) = P(X — a) + F(b) — Еда) — P(X =b) 
(a) Using Eqs. (1.23) and (2.10). we have 
PazXzb-P[(X-a)o(a«X x b)) 
= Р(Х = а} + Р(а < Х < Б) 
= Р(Х =a) + Fy(b) ~ Еа) 
(b We һауе 
P(a < X x b = P[(a < X <b) o (X =Ь)] 
= P(a« X < b)+ P(X = b) 


[CHAP 2 


(2.64) 
(2.65) 
(2.66) 
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Again using Eq. (2.10), we obtain 
P(a < X < b) = Pla« X x b) – Р(Х =b) 
= Fy(b) — F gla) — P(X = b) 
(c) Similarly, P(a < X < b) = P[(a < X < b) o (X = b)] 
= P(a x X < b) + Р(Х = Б) 
Using Eq. (2.64), we obtain 
Plas X <b) =Pla< X <Б) – Р(Х =b) 
= P(X =a) + Fy{b) — Еа) — P(X = b) 


2.8. Let X be the r.v. defined in Prob. 2.3. 
(a) Sketch the cdf F x(x) of X and specify the type of X. 
(b Find (i) P(X < 1), (ii) P(1« X x 2), (iii) P(X > 1), and (iv) P(1 x X < 2). 
(a) From the result of Prob. 2.3 and Eq. (2.18), we have 


0 x«l 
1 1<х<2 
F(x) = Р(Х < х) = 1° > 
xO) = Р(Х $3713 2<х<3 
1 x23 


which is sketched in Fig. 2-11. The r.v. X is a discrete r.v. 
(b) (i) We see that 


(i) By Eq. (2.10), 


(ii) By Eq. (2.11), 


(iv) By Eq. (2.64), 
Р(1<Х<2)=Р(Х = 1) + FQ) - Fy) =F +3-3= 


T 


FY 





Fig. 2-11 


2.9. © Sketch the cdf F x(x) of the r.v. X defined in Prob. 2.4 and specify the type of X. 


From the result of Prob. 2.4, we have 


which is sketched in Fig. 2-12. The r.v. X is a continuous r.v. 
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FAN) 


Fig. 2-12 
2.10. Consider the function given by 
0 х<0 
Е(х)=\х+%  О<х<% 
1 x24 


(a) 
(b) 


(c) 
(а) 


(Ь) 


(с) 


Sketch F(x) and show that F(x) has the properties of a cdf discussed in Sec. 2.3B. 

If X is the r.v. whose cdf is given by F(x), find (i) P(X < 4), (ii) PO < X < 4), (iii) P(X = 0), 
and (iv) P(0 « X < 1). 

Specify the type of X. 

The function F(x) is sketched in Fig. 2-13. From Fig. 2-13, we see that 0 < F(x) < 1 and F(x) is a 


nondecreasing function, F(— оо) = 0, F(oo) = 1, F(0) = 4, and F(x) is continuous on the right. Thus, 
F(x) satisfies all the properties [Eqs. (2.5) to (2.9] required of a cdf. 
(i) We have 


P(X <a)=FG)=44+4=3 


(ii) By Eq. (2.10), 
PO<X <$)=FQ)~FO)=3-4=4 
(й) By Eq. (2.12), 
P(X = 0) = P(X < 0) — P(X < 0) = ЕО) – FO-)=4-0=4 
(iv) By Eq. (2.64), 
POO < X < 1) = P(X =0)+ FG) – FO) = 5+2 5 = 3 


The г.у. Х is a mixed г.у. 


FAX) 
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2.11. Find the values of constants a and b such that 


is a valid cdf. 


To satisfy property | of Fy(x) [0 x Fy(x) < 1], we must have 0 < a < 1 and b > 0. Since b > 0, pro- 
perty 3 of Fix) [Fy(oc) = 1] is satisfied. 1t is seen that property 4 of F(x) [F4( — оо) = 0] is also satisfied. 
For 0 <a < 1 and b > 0, F(x) is sketched in Fig. 2-14. From Fig. 2-14, we see that F(x) is a nondecreasing 
function and continuous on the right, and properties 2 and 5 of F(x) are satisfied. Hence, we conclude that 
F(x) given is a valid cdf if 0 € a < 1 and b > 0. Note that if a = 0, then the r.v. X is a discrete r.v.; if a = 1, 
then X is a continuous r.v.; and if Ó < a < 1, then X is a mixed г.у. 


FQ) 





DISCRETE RANDOM VARIABLES AND PMF'S 
2.12. Suppose a discrete r.v. X has the following pmfs: 
рх(1) = 3 pxQ) = à рх(3) = 8 ржа) = $ 
(а) Find and sketch the cdf F x(x) of the r.v. X. 
(b) Find (i) P(X < 1, (ii) P(1 < X < 3), (ш) P(1 < X x 3) 
(а) By Eq. (2.18), we obtain 


0 x«l 

1 1<х<2 
Fx) = Р(Х <x) = 1i 2<х<3 

$ 3<х<4 

1 x>4 


which is sketched in Fig, 2-15. 
(b (G) By Eq, (2.12), we see that 
P(X < 1)= Fy} =0 
(üi) By Eq. (2.10), 
PO < X <3) = F3) – (0) =3 -253 
(ш) By Eq. (2.64), 


PU < X <3)=P(X = 1) + F3) – (1) =3+9-4=3 


Ni 
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Fig. 2-15 


2.13. (a) Verify that the function p(x) defined by 


(x) id | x-20612.. 
x)= 
P 0 otherwise 


is a pmf of a discrete r.v. X. 
(b) Find (i) P(X = 2), (ii) P(X x 2), (i) P(X > 1). 


(a) Itisclear that 0 < p(x) x 1 and 





Thus, p(x) satisfies all properties of the pmf [Eqs. (2.15) to (2.17)] of a discrete r.v. X. 
(b) (i) By definition (2.14), 
P(X = 2) = 0) = 34)? = & 
(ii) By Eq. (2.18), 


2 
PX <2)= Vp = Rl ++) = 9 
i=0 


(iii) By Eq. (1.25), 
PIX 21) =1- P(X =0)=1-p0)=1-}F=4 


2.44. Consider the experiment of tossing an honest coin repeatedly (Prob. 1.35). Let the r.v. X denote 
the number of tosses required until the first head appears. 


(a) Find and sketch the pmf p4(x) and the cdf F x(x) of X. 
(b) Find (i) P(1 < X < 4), (ii) P(X > 4). 
(a) From the result of Prob. 1.35, the pmf of X 3s given by 
px) = p(k) = Р(Х = к) -G к= 1, 2,... 
Then by Eq. (2.18), 


Fx) = PX < x) - Yeh = У G 
k-1 к=] 


Е (х) = 


ог 
0 x«l 
i 1zx«2 
i 2<x<3 


! п<х<п+1 


— 
| 
— 
Np 
— 
Е] 
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These functions are sketched in Fig, 2-16. 
(b (i) By Eq. (2.10), 
PU < X <4) = Р) - РИ) = 8-4 = 1 
(ii) By Eq. (1.25), 
Р(Х >4)=1-P(X < 4) = 1-Е) =1- 6 = 


рух) Fda) 





2.15. Consider a sequence of Bernoulli trials with probability p of success. This sequence is observed 
until the first success occurs. Let the r.v. X denote the trial number on which this first success 
occurs. Then the pmf of X is given by 


р(х)= Р(Х =x)=(1—p)*'p x=1,2,... (2.67) 


because there must be x — 1 failures before the first success occurs on trial x. The r.v. X defined 
by Eq. (2.67) is called a geometric r.v. with parameter p. 


(a) Show that р(х) given by Eq. (2.67) satisfies Eq. (2.17). 
(b) Find the cdf Р, (х) of X. 


(a) Recall that for a geometric series, the sum is given by 


Sars Sata o qued (2.68) 
п=0 n=l l-r 
Thus, 
2$ r= Yü- i=l = РР 
У px) p M p' p I-(—p p 
(b) Using Eq. (2.68), we obtain 
ы i- (1 — pp 
Р(Х > К) = 1 — рр = ——_—— = (1 — př 2.69 
(X >k) PE ру !p 1-ü-p (1 — p) (2.69) 
Thus, P(X <) =1- Р(Х >k)=1-(1 — р) (2.70) 
апа Ех) = Р(Х < х) = 1-(1- р) x-12.. (2.71) 


Note that the г.у. X of Prob. 2.14 is the geometric г.у. with p = +. 


2.16. Let X be a binomial r.v. with parameters (n, p). 
(a) Show that ру(х) given by Eq. (2.36) satisfies Eq. (2.17). 
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(b Find P(X > 1) Ши = 6and p = 0.1. 


(a) Recall that the binomial expansion formula is given by 


(a +b) = y (or^ (2.72) 


k=0 


Thus, by Eq. (2.36), 


Ур) = Y (ea -p *=(p+1- p=" = 
k=0 


(b) Now Р(Х > 1)=1— Р(Х = 0) – Р(Х = 1) 


1- (оог - (fones 


1 — (0.9)° — 6(0.1X0.9)5 = 0.114 


Ш 


2.17. Let X be a Poisson r.v. with parameter A. 


(a) Show that p,(x) given by Eq. (2.40) satisfies Eq. (2.17). 
(b Find P(X > 2) with 2 = 4. 


(а) By Eq. (2.40), 


x x Дк 
Ур) =e У ae ^6 = 1 
к= 0 к= 0 
(b) With A = 4, we have 
2,4 
p(k) = е “п 
2 
апа Р(Х <2)= Ур (К) 2e *(L + 4 + 8) & 0.238 
k=0 
Thus, Р(Х > 2) =1— Р(Х < 2) $1— 0.238 = 0.762 


CONTINUOUS RANDOM VARIABLES AND PDF’S 
2.48. Verify Eq. (2.19). 
From Eqs. (1.27) and (2.10), we have 
P(X =x) < Pix — e < X € x) = Fy(x) — Fy(x — е) 


for any e > 0. As F(x) is continuous, the right-hand side of the above expression approaches 0 as e — 0. 
Thus, P(X 2 x) « 0. 


2.49. The pdf of a continuous r.v. X is given by 


1 О<х<1 
Лх) = $4 1<х<2 
0 otherwise 


Find the corresponding cdf F y(x) and sketch /у(х) and F(x). 
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By Eq. (2.24), the cdf of X is given by 
0 x <0 


[14-3 0zx«l 
o 
1 x 
Fy(x) = [sas [547-3 1<х<2 
о 1 


1 2 
[за [зат 2<x 
о H 


The functions f(x) and F(x) аге sketched in Fig. 2-17. 


fx) 





Fig. 2-17 
2.20. Let X be a continuous r.v. X with pdf 
kx O<x<l 
Sx) = f otherwise 


where k is a constant. 

(a) Determine the value of k and sketch fy(x). 

(b) Find and sketch the corresponding cdf F у(х). 
(c) Find PG < X <2). 

(a) By Eq. (2.21), we must have k > 0, and by Eq. (2.22), 


[е dx = ; = 1 
Thus, k = 2 and 
2x 0«x«l 
xe) = t otherwise 
which is sketched in Fig. 2-18(a). 
(b) By Ea. (2.24), the cdf of X is given by 
0 x«0 


Ех) = [tato 0zx«l 
o 


[azai I <x 
o 


which is sketched in Fig. 2-18(b). 
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fx) Fux) 





(c) By Eq. (2.25), 
P < X < 2) = Р) – Fah) = 1- 0) = 1 


2.21. Show that the pdf of a normal r.v. X given by Eq. (2.52) satisfies Eq. (2.22). 
From Eq. (2.52), 





a 
e Tead qx 


” 1 
dx = 
[io t Vf 2nd J y 


Let у = (x — u)/(/2 a). Then dx = 4/20 dy and 
1 


a 1 © 
е 670020202) dy = — f e^? dy 
/ 200 [Г Jn — 


Let | e ду = 1 


Then 12 = [| ev ax] | e? | - f f e 91 *»5 dx ду 


Letting x = гсоѕ д and y = rsin @ (that is, using polar coordinates), we have 


2m oo ec 
r= | [ ет" dr 9-2 | e "rdr—-n 
0 0 o 





Thus, l= f e dy = n (2.73) 
and Г fx(x) dx D^ e^? dy І Jn 1 
= — = — п = 
= о х Jn -o Jn 


2.22. Consider a function 


1 
f(x) = ete —o0«x«oo 
т 


Find the value of a such that f(x) is a pdf of a continuous r.v. X. 
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Д(х) = l еб х2 +х-а) u 1 g stn х+\!Ф&+а- 1/4) 
л л 


-|- enm poene 
Jn 


If f (x) is a pdf of a continuous r.v. X, then by Eq. (2.22), we must have 


© x 1 
| fhe) dx= em | —e 0 MP fy = | 


-a л 


1 
Now by Eq. (2.52), the pdf of NG; 1) is — e ^ !?", Thus, 
л 


| = е 0 M dy = |] and | f(x) dx = eeu — | 


-H л х, 


from which we obtain а = 4. 


2.23. A r.v. X is called a Rayleigh г.у. if its pdf is given by 


Xo x2/1202) 
— e х> 0 

Лх) = {о? (2.74) 
0 х< 0 


(a) Determine the corresponding cdf F(x). 
(b) Sketch f(x) and Fy(x) ого = 1. 


(a) By Eq. (2.24), the cdf of X is 
Ех) = | B е 100 дЕ x > 0 
oc 


Let y = €7/(267), Then dy = (1/с2) d£, and 


x2/(2a2) 
F(x) = | e! dy = 1 — e720?) (2.75) 
о 
(b) With o = 1, we have 

xe 2 х> 0 
лбх) = lo x «0 
|1—e 7? x20 
and F(x) = n x<0 


These functions are sketched in Fig. 2-19. 


224. A r.v. X is called a gamma r.v. with parameter (a, 4) (x > O and å > 0) if its pdf is given by 
-= Ах, a-l 
Ae “*(Ax) x20 


m=) т u 226) 


where Г(а) is the gamma function defined by 


го = | ex"! dx a0 (277) 
0 
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ло Ру) 
0.6 l 
0.8 
0.4 
0.6 
04 
02 
02 
0 0 
0 ! 2 3 x 0 l 2 


(a) | (5) 
Fig. 2-19 Rayleigh distribution with e = 1. 


(a) Show that the gamma function has the following properties: 
1. I(a-4 1)-aI(o) а> 0 
2. Ц(к+ 1) = К! k (20): integer 
3. T) = n 

(b) Show that the pdf given by Eq. (2.76) satisfies Eq. (2.22). 

(c) Plot f(x) for (a, 4) = (1, 1), (2, 1), and (5, 2). 

(a) Integrating Eq. (2.77) by parts (u = x*~', dv = e^ * dx), we obtain 


T(a) = —e *x*! 





«[ e^*(a — 1)x*^? dx 
о 0 


= (a — 1) [ev dx = (x — 1)Г(« — 1) 
0 


Replacing a by « + 1 in Eq. (2.81), we get Eq. (2.78). 
Next, by applying Eq. (2.78) repeatedly using an integral value of a, say a = k, we obtain 


T(k + 1) = kT (ky = kk — Yk — 1) = k(k — 1) + QT (1) 
Since Г(1) = [e dx = 1 
о 
it follows that T(k + 1) = k!. Finally, by Eq. (2.77), 
Г(2) = [ex dx 
Let y = x!?, Then dy = ix^"? dx, and 
Г(2) = 2 [er dy = [Г e^? dy = n 
О -w 


in view of Eq. (2.73). 
(b) Now 


e rye! dx 





© де "(xy ! Ae [ 


LL JO) dx -Í Ta “Teh 


Let y = Ax. Then dy = 4 dx and 


Г ло) d aa ery! dy - La ra) = 
Eh: Ta 
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(2.78) 
(2.79) 
(2.80) 


(2.81) 
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f» 





2 3 


Fig. 2-20 Gamma distributions for selected values of « and 4. 


(c) The pdf's f(x) with (a, А) = (1, 1), (2, 1), and (5, 2) are plotted in Fig. 2-20. 
Note that when х = 1, the gamma r.v. becomes an exponential r.v. with parameter 4 [Eq. (2.48)]. 
MEAN AND VARIANCE 


2.25. Consider a discrete r.v. X whose pmf is given by 


1 
_ 33 х = — 1, 0, 1 
ру) = b otherwise 
Plot px(x) and find the mean and variance of X. 


(b) Repeat (a) if the pmf is given by 


(x) = i x = —2,0,2 
PRI lo otherwise 
The pmf p,(x) is plotted in Fig. 2-21(a). By Eq. (2.26), the mean of X is 


их = К(Х)=+(—1+0+1)=0 
Ву Eq. (2.29), the variance of X is 


oy? = Var(X) = EX — њу) = E(X?) = MC- D? + 0) + (01 = 3 
(b) The pmf p,({x) 1s plotted in Fig. 2-21(b). Again by Eqs. (2.26) and (2.29), we obtain 


их = E(X) = 4-2 +0 + 2) =0 


Рух) Рух) 
| П 
3 3 
-à -1 0 | 2 x -2 -l 0 1 2 x 
(a) (b) 


Fig. 2-21 
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2.26. 


2.27. 


2.28. 
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oy? = Var(X) = (72)? + (07 + Q?] = 8 


Note that the variance of X is a measure of the spread of a distribution about its mean. 


Let a r.v. X denote the outcome of throwing a fair die. Find the mean and variance of X. 
Since the die is fair, the pmf of X is 
px(x) = рх(Ю) = 6 К=1,2,...,6 
By Eqs. (2.26) and (2.29), the mean and variance of X аге 


ux = E(X) = 4(1 + 243444546 = 5 = 3.5 
ex! =i — 9? + (2-97 + (3 – 3) +(4-FP +6 – 3) +6-FPI=H8 


Ni 


Alternatively, the variance of X can be found as follows: 
EX?) = 1 22 632 + 42 + 52 +6?) = 4 
Hence, by Eq. (2.31), 
ey! = E(X?) - [Е(Х)р = 4 - GY = 13 


Find the mean and variance of the geometric r.v. X defined by Eq. (2.67) (Prob. 2.15). 


To find the mean and variance of a geometric r.v. X, we need the following results about the sum of a 
geometric series and its first and second derivatives. Let 











gir) = Xat'-—— — qne (2.82) 
п=0 l-r 
„у dg = DE: 
Then g(r) = de У. ат =т=? (2.83) 
" dur = -a 2a 
90) = 2 = „2. атп ир (2.84) 


By Eqs. (2.26) апа (2.67), and letting q = 1 — р, the mean of Х is given by 
by = EX) = Y xq pe з= 5 =- (2.85) 


where Eq. (2.83) is used with a = pandr = q. 
To find the variance of X, we first find E[ X(X — 1)]. Now, 














E[X(X — 0] = È xix — Uq*^'p = Y pax(x – 0q*7? 
x-l x-2 
2pq 2р4 2q Xl — p) 
- - 23202 (2.86) 
(0-9? р р? р? 
where Eq. (2.84) is used with a = pq and r = q. 
Since E[X(X — 1)] = E(X? — X) = E(X?) — E(X), we have 
Al—p) 1! 2— 
E(X?) = ELX(X — D) + кх) = UO L1 22 77 (2.87) 
p p p 
Then by Eq. (2.31), the variance of X is 
2- 1 t- 
oy? = Van X) = F(X?) - [EGO]! = = -3 z (2.88) 


Let X be a binomial r.v. with parameters (n, p). Verify Eqs. (2.38) and (2.39). 
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By Eqs. (2.26) and (2.36), and letting q = | — p, we have 


E(X) = Y kpy(k) = XM - 
k=0 k=0 


n! 
n (n — k)!k! 
ш (n — 1)! 


=" mooki” 


коп 


pq 


k-14n—k 


Letting i = k — 1 and using Eq. (2.72), we obtain 


c (n-1y 
E(X) = "Y (пт 


= у, (" » eis 
i=0 1 


= np(p + q^! = пр(1)' 7! = пр 


Pq 











Next, E[X(X - 10] = y kk — Lp x(k) = Sak - І (е -* 
k=0 
ч k(k n! konk 
= Ук 0 ева? 4 
_ а (n — 2)! Àh-2,n—-k 
Snn- DE Oka 7 
Similarly, letting i = k — 2 and using Eq. (2.72), we obtain 
- 2)! | 
E[X(X — 1] = n(n — 1)р? bi u реса 


(82—011 
= n(n – bey, ( i Det 
= n(n — рр + q^? = n(n — Up? 
Thus, E(X?) = E[X(X — 1)] + E(X) = піп — Up? + np (2.89) 
and by Eq. (2.31), 
ey? = Var(x) = n(n — Up? + np — (np)? = np(1 — p) 


2.29. Let X be a Poisson r.v. with parameter 4. Verify Eqs. (2.42) and (2.43). 
By Eqs. (2.26) and (2.40), 





op k 
P = — = -À 
E(X) D kpy(k) = le 0+ x KD 
a g 
= Ae? e^ едо = 4 
o K-D DE 
ou Nu i Ako? 
— = — — = 257 
Next, E[X(X — 10 È kk tet £ = Pe Lamm LEA 


i 


= еу А елор 
izol 


Thus, E(X?) = ELX(X — D] + EX) = А2 +4 (2.90) 
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2.30. 


2.31. 


2.32. 
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and by Eq. (2.31), 
oy? = Var(X) = EX?) - [EX) = à2 44-22 =a 





Find the mean and variance of the r.v. X of Prob. 2.20. 
From Prob. 2.20, the pdf of X is 


2x O<x<! 
0 otherwise 


/х(х) = { 


By Eq. (2.26), the mean of X is 


1 хэ 
uy = E(X) = | xQx) dx = 2— 


0 


1 
—2 
= 3 





By Eq. (2.27), we have 


1 


[| 
Nie 


1 x4 
E(X?) = | x'(2x) dx = 2— 
o 4 





0 


Thus, by Eq. (2.31), the variance of X is 
oy? = Var(X) = EX?) - [EF = 3 - (8? = ds 


Let X be a uniform r.v. over (a, b). Verify Eqs. (2.46) and (2.47). 
By Eqs. (2.44) and (2.26), the mean of X is 


b 1 ) x? 


= ЕХ) = 
Bx (x) [5 “Spa? 








а 





Ву Eq. (2.27), we have 
1 3 


dx = —— 


lI = Цр? b 2 
boa hoa (b? + ab + a?) 





b 
E(X?) = | х? 
Thus, by Eq. (2.31), the variance of X is 


oy? = Var(X} = E(X?) - [Е(Х)]? 
= 3Р2 + ab + a?) — ДЬ + a) = (b — ay 





b 
d 


Let X be an exponential r.v. X with parameter 4. Verify Eqs. (2.50) and (2.51). 
By Eqs. (2.48) and (2.26), the mean of X is 


их = E(X) = | хде ^ dx 


0 


Integrating by parts (u = x, dv = Ae ^ ?* dx) yields 


E(X) = – хет?" 





Next, by Eq. (2.27), 
E(X?) = | x!àe 7 dx 
О 
Again integrating by parts (u = x?, dv = de” ** dx), we obtain 


Е(Х?) = — x2e^ Ах 





E EH -à 2 
+2 xe dx => 
0 0 A 


[CHAP 2 


(2.91) 


(2.92) 
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Thus, by Eq. (2.37), the variance of X is 


2 2 | 
oy? = E(X?) — (EX) = 4 — (3) 2l 


2.33. Let X = N(u; o°). Verify Eqs. (2.57) and (2.58). 


Using Eqs. (2.52) and (2.26), we have 


иу = E(X) = 





o 
| xe 700100202) qx 


1 


Writing x as (x — и) + р, we have 








1 ao 
E(X) - | (x = pe” 69210282) dy 4. y 


Pine 


Letting y = x — ріп the first integral, we obtain 


1 © 
eg W720) qu 
/2n6 


V 





1 а о 
Е(Х) = | ye "07 dy + р | Sx) dx 


„Оло C00 
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The first integral is zero, since its integrand is an odd function. Thus, by the property of pdf Eq. (2.22), we 


get 
Hy = ЕХ) =p 
Next, by Eq. (2.29), 


oy? = E[(X — u?) = 





20 
| (х= uye € Gen dx 


l 
J2n6 d_a 
From Eqs. (2.22) and (2.52), we have 


x 
| g 95702) dy = с. /2n 
xX 


Differentiating with respect to е, we obtain 


* (х= И)? |... aue 
| =e uy 9 dx = „Рт 


Multiplying both sides by o?/ /2z, we have 
1 


ле 


Thus, ву? = Var(X) = о? 





æ 
| (x _ pyre - (x- uy} 202} dx = в? 
- х 


2.34. Find the mean and variance of a Rayleigh г.у. defined by Eq. (2.74) (Prob. 2.23). 


Using Eqs. (2.74) and (2.26), we have 


а, 


X | x _ 
x © е x2/(202) dx = — х?е x2/((2a2) dx 
в 6? Jo 


„== [ 


0 
Now the variance of N(0; с?) is given by 
1 


ine 





X 
| xlg^ P20) dy = g? 
— 906 
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Since the integrand is an even function, we have 


1 


Jno 
* л 
ог | х?е "on dx = 1n e? = 5 c? 
0 


1 л л 
Тһеп иу = E(X) = 3 E о? = f c (2.93) 





x 
| xe 910292 dy = 1g? 
о 


© 1 xX 
Next, Е(Х?) = | x? = e^ rem dy = — | x3e7 521092) dy 
0 с T Jo 
Let y = х?/(2е?). Then dy = x йх/о?, and so 
E(X?) = 2e? | ye ? dy = 20? (2.94) 
0 
Hence, by Eq. (2.31), 
oy! = E(X?) — [E(X)P = (2 - Ju = 0.4290? (2.95) 


Consider a continuous r.v. X with pdf f,(x). If f(x) = 0 for x < 0, then show that, for any a > 0, 


P(X >а) < а (2.96) 


where иу = E(X). This is known as the Markov inequality. 
From Eq. (2.23), 


Р(Х > а) = [ле dx 


Since fy(x) = 0 for x < 0, 


ux = E(X) = | рх) dx > [эл dx >a [ле dx 
o a а 


Непсе, | Syl) dx = P(X > a) s Ë 
lo a 


For any a > 0, show that 


2 
o 
PX — ux 2 a) < T (2.97) 


where иу and сх? are the mean and variance of X, respectively. This is known as the Chebyshev 
inequality. 


From Eq. (2.23), 


ux-a a: 


fx(x) dx = | Лх) dx 


Ix -ux| za 


fx (x) dx + | 


Hata 


PIX = wiz a= | 


-æ 


By Eq. (2.29), 


ox = f (x — ux falx) dx > | 


=a |x uxla 


(x — Hx f(x) dx > a? | Р(х) dx 


Ix - uxl 2а 
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2 


Hence, | Jao) dx < $ or РХ — pxl 2 a)  X- 
[х -ux| za a? a 


Note that by setting a = ko, in Eq. (2.97), we obtain 


І 
P( X — их > key) < x5 (2.98) 


Equation (2.95) says that the probability that a r.v. will fall k or more standard deviations from its mean is 
<1/k?. Notice that nothing at all is said about the distribution function of X. The Chebyshev inequality is 
therefore quite a generalized statement. However, when applied to a particular case, it may be quite weak. 


SPECIAL DISTRIBUTIONS 


2.37. 


2.38. 


2.39. 


A binary source generates digits 1 and 0 randomly with probabilities 0.6 and 0.4, respectively. 


(a) What is the probability that two 15 and three Os will occur in a five-digit sequence? 
(b) What is the probability that at least three 15 will occur in a five-digit sequence? 


(а) Let X be the r.v. denoting the number of Is generated in a five-digit sequence, Since there are only two 
possible outcomes (1 or 0), the probability of generating 1 is constant, and there are five digits, it is 
clear that X is a binomial r.v. with parameters (n, p) = (5, 0.6). Hence, by Eq. (2.36), the probability 
that two 1s and three Os will occur in a five-digit sequence is 


Р(Х = 2) = (3)(0.6)7(0.4)? = 0.23 
(b) The probability that at least three Is will occur in a five-digit sequence is 


P(X > 3)=1- P(X < 2) 


2 
where =È at Joos *= 0.317 


Hence, P(X > 3) = 1 — 0.317 = 0.683 


A fair coin is flipped 10 times. Find the probability of the occurrence of 5 or 6 heads. 


Let the r.v. X denote the number of heads occurring when a fair coin is flipped 10 times. Then X is a 
binomial r.v. with parameters (я, p) = (10, +). Thus, by Eq. (2.36), 


6 10 JI 
P(S < X <6)= -ji- = 0.451 
Osx <6) xGG 2 


Let X be a binomial r.v. with parameters (n, p), where 0 « p < 1. Show that as k goes from 0 to 
n, the pmf p,(k) of X first increases monotonically and then decreases monotonically, reaching its 
largest value when К is the largest integer less than or equal to (n + 1)р. 


By Eq. (2.36), wc have 


(Da -p * 

рх) Л (п-к Dp (299) 

px(k — 1) ( n » (prot k(l — p) . 
k-1 


Hence, py(k) > py(k — 1) if and only if (n — k + Dp > k(1 — p) or k € (n + Dp. Thus, we see that py(k) 
increases monotonically and reaches its maximum when К is the largest integer less than or equal to 
(n + Dp and then decreases monotonically. 
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2.40. Show that the Poisson distribution can be used as a convenient approximation to the binomial 


2.41. 


2.42. 


distribution for large n and small p. 
From Eq. (2.36), the pmf of the binomial r.v. with parameters (n, p) is 
n(n — 14и — 2)+--(n—k + 1) 
pits 


px(&) = (bea rq gere ~~ р)" 


Multiplying and dividing the right-hand side by и“, we have 


k! 


If we let n — co in such a way that np = A remains constant, then 


у) 

п п п пос 

А) ане 
n n n no 


where we used the fact that 





Hence, in the limit as п — oo with np = 4 (and as p = A/n > 0), 
(Deo eps PUN: ete пр = А 


Thus, in the case of large n and small р, 
k 


BON к Nik S ar = 
(rea py “xe T np=A 


which indicates that the binomial distribution can be approximated by the Poisson distribution. 


A noisy transmission channel has a per-digit error probability p = 0.01. 


(a) Calculate the probability of more than one error in 10 received digits. 
(b) Repeat (a), using the Poisson approximation Eq. (2.100). 


(2.100) 


(a) It is clear that the number of errors in 10 received digits is a binomial r.v. X with parameters (n, p) = 


(10, 0.01). Then, using Eq. (2.36), we obtain 
Р(Х >1)= 1 — Р(Х 0)— P(X = 1) 


=]— (e joon*tosoy* = (T joon'oso? 


1 
= 0.0042 
(b) Using Eq. (2.700) with А = пр = 100.01) = 0.1, we have 
P(X > 1)=1— P(X = 0) — Р(Х = 1) 
(0.1)° 2o 4 (0.1)! 
on 1! 


-1- e 91 


= 0.0047 


The number of telephone calls arriving at a switchboard during any 10-minute period is known 


to be a Poisson r.v. X with A = 2. 
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2.43. 


2.44. 


2.45. 


(a) Find the probability that more than three calls will arrive during any 10-minute period. 
(b) Find the probability that no calls will arrive during any 10-minute period. 
(a) From Eq. (2.40), the pmf of X is 

2k 


рх) = P(X =k) =e? k=0,1,... 
3 2k 
Thus, Р(Х > 3) = -F(X < 3)=1- Ye? ОП 
k=0 : 


=1-е741+2 + + 8) 20148 
(b P(X = 0) =p,(0) = е”? x 0.135 


Consider the experiment of throwing a pair of fair dice. 


(a) Find the probability that it will take less than six tosses to throw a 7. 
(b) Find the probability that it will take more than six tosses to throw a 7. 


(a) From Prob. 1.31(a), we see that the probability of throwing a 7 on any toss is $. Let X denote the 
number of tosses required for the first success of throwing a 7. Then, from Prob. 2.15, it is clear that X 
is a geometric r.v. with parameter p = $. Thus, using Eq. (2.71) of Prob. 2.15, we obtain 


P(X < 6) = Р(Х < 5) = Е,(5) = 1 — ($) = 0.598 
(b) Similarly, we get 
P(X > 6)= 1 — P(X < 6) = 1 – F,(6) 
21-[1- Q5] = (5 ~ 0.335 


Consider the experiment of rolling a fair die. Find the average number of rolls required in order 
to obtain a 6. 


Let X denote the number of trials (rolls) required until the number 6 first appears. Then X is a 
geometrical r.v. with parameter p = 4. From Eq. (2.85) of Prob. 2.27, the mean of X is given by 


Thus, the average number of rolls required in order to obtain a 6 is 6. 


Assume that the length of a phone call in minutes is an exponential r.v. X with parameter 
A = 45. If someone arrives at a phone booth just before you arrive, find the probability that you 
will have to wait (a) less than 5 minutes, and (b) between 5 and 10 minutes. 


(a) From Eq. (2.48), the pdf of X is 


de x0 


Лх) = f x<0 


Then 
5 
= ] — е795 м 0.393 


0 


5 
Р(Х < 5) = | gge? dx = —e 110 
О 





(b) Similarly, 


10 
Р(5<Х <10)= | е "9 dx 2e 95—e7' œ 0.239 
3 
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All manufactured devices and machines fail to work sooner or later. Suppose that the failure rate 

is constant and the time to failure (in hours) is an exponential r.v. X with parameter 4. 

(a) Measurements show that the probability that the time to failure for computer memory chips 
in a given class exceeds 10* hours is e ^ ! (20.368). Calculate the value of the parameter 4. 

(b) Using the value of the parameter 4 determined in part (a), calculate the time x such that 
the probability that the time to failure is less than xy is 0.05. 


(a) From Eq. (2.49), the cdf of X is given by 


ЕД) = t-e ^ х> 0 
х= 0 х< 0 
Now P(X > 104) = 1 — P(X < 10%) = 1 — F,(104) 


MOS) = 74106 Logo] 


= 1-– (1-е е 


from which we obtain А = 107+. 


(b) We want 
Ехо) = P(X < хо) = 0.05 
Hence, L — e77 = | — e 19° = 0.05 
or e 10 **o = 0,95 


from which we obtain 


ху = — 10° In (0.95) = 513 hours 


A production line manufactures 1000-ohm (Q) resistors that have 10 percent tolerance. Let X 
denote the resistance of a resistor. Assuming that X is a normal r.v. with mean 1000 and variance 
2500, find the probability that a resistor picked at random will be rejected. 


Let A be the event that a resistor is rejected. Then A = (X < 900} о (X > 1100}. Since (X < 900} ^ 
(X > 1100} = Ø, we have 


P(A) = Р(Х < 900) + P(X > 1100) = F4,(900) + [1 — Fy(1100)] 
Since X is a normal г.у. with u = 1000 and о? = 2500 (a = 50), by Eq. (2.55) and Table A (Appendix А), 


900 — 1000 
2300 = Ф 2) = 1 — Ф(2) 


1100 — 1 
F (1100) = o( en) = (2) 


Thus, P(A) = 2[1 — Ф(2)] = 0.045 


F (900) = of 


The radial miss distance [in meters (m)] of the landing point of a parachuting sky diver from the 

center of the target area is known to be a Rayleigh r.v. X with parameter o? = 100. 

(а) Find the probability that the sky diver will land within a radius of 10 m from the center of 
the target area. 

(b) Find the radius r such that the probability that X > rise ! (20.368). 


(a) Using Eq. (2.75) of Prob. 2.23, we obtain 
P(X < 10) = F,(10 = 1 – е7 190/009 = 1 — 0795 z 0,393 
(b) Now 
P(X >r)=1— Р(Х <r)= 1 — Fy(r) 


1 — (1 27721200) = 912/200 0-1 


from which we obtain r? = 200 and r = 4/200 = 14.142 т. 
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2.49, Let X be a Poisson r.v. with parameter 4. Find the conditional pmf of X given B = (X is even). 
From Eq. (2.40), the pdf of X is 


Then the probability of event B is 





» | Ak 
P(B) = Р(Х = 0, 2,4,..)= У ea 
k=even ' 
Let A = {X is odd}. Then the probability of event A is 
© AE 
P(A) = P(X = 1,3,5,..) = Yen 
kz:odd . 
Now 
20 А РЫ со РЫ © РЫ 
е^ — + ет = е^ — = елед = 1 (2.101) 
P k! P k! k=0 k! 
© РЫ © РЫ © (– 2) 
“Af -àL = = А = -À “AL -2A 2.1 2 
ee eye и" RT (2102 
Hence, adding Eqs. (2.101) and (2.102), we obtain 
а РЫ 
РВ) = Y e^ H^ ire?) (2.103) 
k=even . 
Now, by Eq. (2.62), the pmf of X given B is 
P((X =k) с B} 
k| B) = ———————— 
px(k| В) P(B) 


If k is even, (X = k) c Band (X =k) a B = (X = k). If k is odd, (X = k) à B= Ø. Hence, 
P(X =k) 2e *A* 








BB deery) = even 
Px(k| B) = Р(@) 
(В) =0 k odd 


2.50. Show that the conditional cdf and pdf of X given the event В = (a < X x b) are as follows: 


0 x<a 
_ LF x(x) — F x(a) . 
Fy(xja<X <b)= Fy(b) — Fala) a<x<b (2.104) 
1 х>Ь 
0 хха 
fdxla<X <Б) = S20... a«xxzb (2.105) 


fx(&) 46 
0 x>b 
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Substituting B = (a < X < b) in Eq. (2.59), we have 


Fla < X «bye PX < xla < X «6 e LU S9 0 (a « X s Dj 


P(a« X x b) 
e x<a 
Now (Х< х) о (а<Х< 6) = 4(0<Х <x) а<х <Б 


(а<Х <b) х> Б 


Hence, . PO) . 

Fdxla < X сс) х<а 
Р(а<Х <x) Fy(x) — Еа) 

A X b) = ——————— = b 

Fala < X = FY Sb) РАЮ ку SS 
P(a « X <b) 

› X &b)2L————- 

Fy(x|a « X <b) Plax X <b) 1 х>Ь 


By Eq. (2.63), the conditional pdf of X given a « X x b is obtained by differentiating Eq. (2.104) with 
respect to x. Thus, 


0 X<a 
Fyxla<X <) = тр а<х<Ь 
* x [леа 
0 x>b 


2.51. Recall the parachuting sky diver problem (Prob. 2.48). Find the probability of the sky diver 
landing within a 10-m radius from the center of the target area given that the landing is within 
50 m from the center of the target area. 


From Eq. (2.75) (Prob. 2.23) with о? = 100, we have 
F(x) = 1 — е 72099 


Setting x = 10 and b = 50 and a = — œ in Eq. (2.104), we obtain 





F,(10) 
0 0) = F,(10 0) = —— 
P(X <10|Х < 50) = F,(10| X < 50) F,(50) 
1 — e~ '00/200 
2.52. Let X = N(0; o°). Find E(X | X > 0) and Var(X |X > 0). 
From Eq. (2.52), the pdf of X = N(0; c?) is 
f, (x = e 20242) 
xx 2ло 
Then by Eq. (2.105), 
0 x <0 
1 
fx X >0)= Vae) = 2 —— 67521002) х> 0 (2.106) 


| “мә vane 
1 


Vinc 


Hence, E(X|X >0)=2 





9с 
xe 7207) dx 
о 
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2.53. 


Let y = x?/(20?). Then dy = x dx/c?, and we get 


2 


с |? 2 
E(X |X > 0) = | е dy = A (2.107) 
VJ 2n Jp т 
х 





1 
Эле o 


1 © 
= f x1g 9/000 ах = Var(X) = о? (2.108) 


ne 


Next, E(X?|X »50) 22 297x123) dx 








Then by Eq. (2.37), we obtain 
Var(X | X > 0) = E(X?|X > 0 - (E(X] X > oP 
2 
= ei — 2) = 0.363 a? (2.109) 


А r.v. X is said to be without memory, or memoryless, if 
P(X <x+t|X >t)= P(X <x) xt-0 (2.110) 


Show that if X is a nonnegative continuous r.v. which is memoryless, then X must be an expo- 


nential r.v. 
By Eq. (1.39), the memoryless condition (2.110) is equivalent to 


P(XEXxttX»1- . 
- pgr») 9 <x) 


or P(X <x +t, X >t) = Р(Х < х)Р(Х > 1) (2.111) 
If X is a nonnegative continuous r.v., then Eq. (2.111) becomes 
P(t < X <x+t)= P(O < X <x)P(X > t) 
or [by Eq. (2.25)], 
Fy(x + t) — Ех) = [F x(x) — FOO — Р) 
Noting that F,(0) = 0 and rearranging the above equation, we get 


Ех +9 — Fylx) Fx! — Fxo9] 
t t 





Taking the limit as t > 0, we obtain 
Ех) = FOL — Fx(x)] (2.112) 

where F(x) denotes the derivative of F у(х). Let 

R(x) = 1 — Fy(x) (2.113) 
Then Eq. (2.112) becomes 

Ку(х) = Ry(0)Ry(x) 
The solution to this differential equation is given by 

R yx) = Кек» 


where k is an integration constant. Noting that k = R,(0) = 1 and letting R(0) = —F (0) = —f4(0) = —4, 
we obtain 


Кх) = е? 
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2.55. 


2.56. 


2.57. 


2.58. 
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and hence by Eq. (2.113), 
Ех) = 1 Кух) = 1 — e7” x»0 


Thus, by Eq. (2.49), we conclude that X is an exponential r.v. with parameter 4 = /,(0) (> 0). 
Note that the memoryless property Eq. (2.110) is also known as the Markov property (see Chap. 5), and 
it may be equivalently expressed as 


P(X »х+1ї|Х >t)= Р(Х > х) x>0,t>0 (2.114) 
or P(X >x4+t) = Р(Х > x)P(X > t) х> 0,1> 0 (2.115) 


Let X be the lifetime (in hours) of a component. Then Eq. (2.114) states that the probability that the 
component will operate for at least x + г hours given that it has been operational for t hours is the same as 
the initial probability that it will operate for at least x hours. In other words, the component "forgets" how 
long it has been operating. 


Note that Eq. (2.115) is satisfied when X is an exponential r.v., since Р(Х > x) = 1 — Fy(x) =e ** and 
g MET) ы от xe at 


Supplementary Problems 


Consider the experiment of tossing a coin. Heads appear about once out of every three tosses. If this 
experiment is repeated, what is the probability of the event that heads appear exactly twice during the first 
five tosses? 


Ans. 0.329 
Consider the experiment of tossing a fair coin three times (Prob. 1.1). Let X be the r.v. that counts the 


number of heads in each sample point. Find the following probabilities: 
(a) P(X < 1); (b) P(X > 1); and (c) P(0 < X < 3). 


Ans. (a) 4, (b 4, (c) à 
Consider the experiment of throwing two fair dice (Prob. 1.31). Let X be the r.v. indicating the sum of the 
numbers that appear. 
(a) What is the range of X? 
(b) Find (i) P(X = 3); (it) P(X < 4); and (iii) PG < X x 7). 
Ans. (а) Ry = {2,3,4,..., 12} 
(b) (+; Gai) 4; (ui) 5 
Let X denote the number of heads obtained in the flipping of a fair coin twice. 
(a) Find the pmf of X. 


(b) Compute the mean and the variance of X. 


Ans. (a) px(0) = 4, Px(1) = 4, px(2) = 4 
(b) Е(Х) = 1, Var(X) = $ 
Consider the discrete r.v. X that has the pmf 
Px(x,) = Gy* x, = 1, 2, 3,... 
Let A = (6: X(0 = 1, 3, 5, 7, ...). Find P(A). 


Ans. $ 
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2.59. 


2.60. 


2.61. 


2.62. 


2.63. 


2.64. 


2.65. 


2.66. 


Consider the function given by 


k 

= = 1, 2,3,... 
х=! C 

0 otherwise 


where k is a constant. Find the value of k such that р(х) can be the pmf of a discrete г.у. X. 
Ans k= 6/n? 
It is known that the floppy disks produced by company А will be defective with probability 0.01. The 


company sells the disks in packages of 10 and offers a guarantee of replacement that at most 1 of the 10 
disks is defective. Find the probability that a package purchased will have to be replaced. 


Ans. 0.004 


Given that X is a Poisson r.v. and p,(0) = 0.0498, compute E(X) and P(X > 3). 
Ans E(X) = 3, P(X > 3) = 0.5767 


A digital transmission system has an error probability of 107° per digit. Find the probability of three or 
more errors in 10° digits by using the Poisson distribution approximation. 


Ans. 0.08 


Show that the pmf p,(x) of a Poisson r.v, X with parameter А satisfies the following recursion formula: 


4 k 
px(k + 1) = kal Py(k) px(k — 1) = i px(k) 


Hint: Use Eq. (2.40). 


The continuous r.v. X has the pdf 


k(x — х?) О<х<1 
f = А otherwise 
where k is a constant. Find the value of k and the cdf of X. 
0 x<0 
Ans. k= 6; Ех) = 43x? - 2х) 0<х51 
1 х> | 


The continuous г.у. X has the pdf 


k(2x — x?) 0<х<2 
0 otherwise 


Ax) = { 


where k is a constant. Find the value of k and P(X > 1). 


Ans. k=3;P(X > 1) = 1 


A r.v. X is defined by the cdf 


0 x«0 
Ех) = {ух O<x<l 
k l<x 


(a) Find the value of k. 
(b) Find the type of X. 
(c) Find (i) PG < X < 1); (ii) PG < X < 1); and (iti) P(X > 2). 
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2.67. 


2.68. 


2.69. 


2.70. 


2.71. 


2.72. 
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Ans. (a) К=1. 
(b) Mixed r.v. 
(с) (i) 3; (ii) 4; Gii) O 
It is known that the time (in hours) between consecutive traffic accidents can be described by the exponen- 
tial r.v. X with parameter A = gg. Find (i) P(X < 60); (ii) P(X > 120); and (iii) P(10 < X < 100). 
Ans. (1) 0.632; (11) 0.135; (in) 0.658 
Binary data are transmitted over a noisy communication channel in block of 16 binary digits. The probabil- 


ity that a received digit is in error as a result of channel noise is 0.01. Assume that the errors occurring in 
various digit positions within a block are independent. 


(a) Find the mean and the variance of the number of errors per block. 
(b) Find the probability that the number of errors per block is greater than or equal to 4. 
Ans. (a) E(X) = 0.16, Var(X) = 0.158 

(b 0.165 x 1074 


Let the continuous r.v. X denote the weight (in pounds) of a package. The range of weight of packages is 
between 45 and 60 pounds. 


(a) Determine the probability that a package weighs more than 50 pounds. 
(b) Find the mean and the variance of the weight of packages. 

Hint: Assume that X is uniformly distributed over (45, 60). 

Ans. (a) $; (b E(X) = 52.5, Var(X) = 18.75 


In the manufacturing of computer memory chips, company A produces one defective chip for every nine 
good chips. Let X be time to failure (in months) of chips. It is known that X is an exponential r.v. with 
parameter 4 = + for a defective chip and 4 = 74 with a good chip. Find the probability that a chip pur- 
chased randomly will fail before (a) six months of use; and (Ь) one year of use. 


Ans. (a) 0.501; (b) 0.729 


The median of a continuous r.v. X is the value of x = хо such that P(X > x9) = Р(Х < xo). The mode of X 
is the value of x = x,, at which the pdf of X achieves its maximum value. 


(a) Find the median and mode of an exponential r.v. X with parameter 4. 
(b) Find the median and mode of a normal r.v. X = N(p, o?). 
Ans. (а) xg = (In 2)/A = 0.693/4, Xm = 0 
(b) Xo = Xm = н 
Let the r.v. X denote the number of defective components in a random sample of n components, chosen 


without replacement from a total of N components, r of which are defective. The r.v. X is known as the 
hypergeometric r.v. with parameters (N, г, п). 


(a) Find the pmf of X. 
(b) Find the mean and variance of X. 


Hint: To find E(X), note that 


To find Var(X), first find E[X(X — 1)]. 
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2.73. 


274. 


2.75. 


2.76. 


2.77. 


V 
ө ню) men o = 71 


A lot consisting of 100 fuses is inspected by the following procedure: Five fuses are selected randomly, and 
if all five “blow” at the specified amperage, the lot is accepted. Suppose that the lot contains 10 defective 
fuses. Find the probability of accepting the lot. 


Ans. (а) px(x) = 





Hint: Let X bea r.v. equal to the number of defective fuses in the sample of 5 and use the result of Prob. 
2.72. 


Ans. 0.584 
Consider the experiment of observing a sequence of Bernoulli trials until exactly r successes occur. Let the 


r.v. X denote the number of trials needed to observe the rth success. The r.v. X is known as the negative 
binomial r.v. with parameter p, where p is the probability of a success at each trial. 


(a) Find the pmf of X. 
(b) Find the mean and variance of X. 


Hint: To find E(X), use Maclaurin's series expansions of the negative binomial h(q) = (1 — q)~’ and its 
derivatives A'(g) and h"(q), and note that 


= k-1 e (х1 
hq) =(1-g) "= Y (+ )at = У, C уг 


To find Var(X), first find E[(X — Р(Х — r — 1)] using A’(q). 


Ans. (a) ni = (pa or^ x-rnrdal.. 


(b) EX)- (i Vax) = =? 
р р 





Suppose the probability that а bit transmitted through a digital communication channel and received in 
error is 0.1. Assuming that the transmissions are independent events, find the probability that the third 
error occurs at the lOth bit. 


Ans. 0.017 


А r.v. X is called a Laplace г.у. if its pdf is given by 
fx(x) = ke 4 A>0, —o«x«oco 
where k is a constant. 
(a) Find the value of k. 
(b) Find the cdf of X. 
(c) Find the mean and the variance of X. 
Ans. (a) К=А2 (b) Рх) = na xs 
I-e x20 
(c) E(X) = 0, Var(X) = 2/2? 


A r.v, X is called a Cauchy r.v. if its pdf is given by 
k 


a? + x? 





fx) = 


—oo«x«oo 
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where a (> 0) and k are constants. 


(a) Find the value of k. 
(b) Find the cdf of X. 
(c) Find the mean and the variance of X. 


1 |] -fX 
Ans. (a) К=а/л (b Ех) = 5 + z tan » 


(c) E(X) = 0, Var(X) does not exist. 


[CHAP 2 


Chapter 3 


Multiple Random Variables 


31 INTRODUCTION 


In many applications it is important to study two or more r.v.’s defined on the same sample 
space. In this chapter, we first consider the case of two r.v/s, their associated distribution, and some 
properties, such as independence of the r.v.'s. These concepts are then extended to the case of many 
r.v.'s defined on the same sample space. 


3.2. BIVARIATE RANDOM VARIABLES 


A. Definition: 


Let 5 be the sample space of a random experiment. Let X and Y be two r.v.’s. Then the pair (X, 
Y) is called a bivariate r.v. (or two-dimensional random vector) if each of X and Y associates a real 
number with every element of S. Thus, the bivariate r.v. (X, Y) can be considered as a function that to 
each point С in 5 assigns a point (х, y) in the plane (Fig. 3-1). The range space of the bivariate r.v. (X, 
Y) is denoted by R,, and defined by 


R,, = {(x, у); £C e S and Х({) = x, Ү() = y] 


If the r.v.s X and Y are each, by themselves, discrete r.v.'s, then (X, Y) is called a discrete 
bivariate r.v. Similarly, if X and Y are each, by themselves, continuous r.v.'s, then (X, Y) is called a 
continuous bivariate r.v. If one of X and Y is discrete while the other is continuous, then (X, Y) is 
called a mixed bivariate r.v. 





Fig. 3-1 (X, Y)as a function from S to the plane. 
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33 JOINT DISTRIBUTION FUNCTIONS 
А. Definition: 


The joint cumulative distribution function (or joint cdf) of X and Y, denoted by F ,,(x, у), is the 
function defined by 


Fyy(x, y) = Р(Х € x, Y x y) (3.1) 


The event (€ < x, Y < y) in Eq. (3.7) is equivalent to the event A ^ B, where A and B are events of 5 
defined by 


А= (065; (0) хх) and  B-(teS; YQ) « y) (3.2) 
and P(A) = Ех) P(B) = F,(y) 
Thus, Fy x, y) = P(A ^ B) (3.3) 


If, for particular values of x and y, A and B were independent events of S, then by Eq. (1.46), 
Fyy(x, у) = P(A ^ B) = Р(А)Р(В) = F x(x) Fy(y) 


B. Independent Random Variables: 
Two r.v.'s X and Y will be called independent if 
Fyy(x, у) = F(x) Fy(y) (3.4) 


for every value of x and у. 


C. Properties of Fyy(x, y): 


The joint cdf of two r.v.’s has many properties analogous to those of the cdf of a single r.v. 


1. OK< Fyy(x, y) <1 (3.5) 
2. Ifx, <x, and y, < y;, then 
Fyy(Xy, Yi) € Р(х. у) < Fryl%2, У) (3.6) 
Fyy(Xxi, у) < Fxy(X y, уз) € Fxy(x2, Y2) (3.6b) 
3. lim Fyy(x, y) = Руу(оо, со) = 1 (3.7) 
75 
4. lim Fyy(x, y) = Fyf—«, y) = 0 (3.8а) 
lim Fxy(x, y) = Fyy(x, — со) = 0 (3.85) 
yoo 
5, lim Fyy(x, y) = Руа“, y) = Fyy(a, y) (3.9a) 
lim Fyy(x, у) = Fyy(x, 67) = Fyy(x, b) (3.9b) 
yb* 
6. Р(х < X <x, Y < y) = Е, (х, y) — Ех, y) (3.10) 
P(X € х,у, < Y € yj) = Fry, y3) — Fxy(x, у) (3.11) 
7. Их, <x, and y, € уз, then 
Fyy(X2. уз) — Frys, уг) — Fxy(X3, yi)  Fxy(x,, у) 20 (3.12) 


Note that the left-hand side of Eq. (3.72) is equal to P(x, < X < x5, y, < Y < y;) (Prob. 3.5). 
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D. Marginal Distribution Functions: 


Now lim(X <x, Y x y)=(X <x, ¥ < 0) =(X <x) 


since the condition y x оо is always satisfied. Then 


lim Fyy(x, y) = Fyy(x, o0) = F (x) (3.13) 
yo 
Similarly, lim уух, y) = Fyy(o0, y) = Fy(y) (3.14) 


The cdf’s F(x) and F,(y), when obtained by Eqs. (3.13) and (3.14), are referred to as the marginal 
cdf's of X and Y, respectively. 


34 DISCRETE RANDOM VARIABLES—JOINT PROBABILITY MASS FUNCTIONS 
A. Joint Probability Mass Functions: 


Let (X, Y) be a discrete bivariate r.v., and let (X, Y) take on the values (xi, yj for a certain 
allowable set of integers i and j. Let 


Pxy(xi, y) = P(X = xi, Y = y) (3.15) 
The function py,(x;, yj) is called the joint probability mass function (joint pmf) of (X, Y). 


B. Properties of pxy(x,, y): 


1. 0<руу(х;, у) € 1 (3.16) 
2. YY Pu у) = 1 (3.17) 
Xi yj 
3 P[X,Y)eA]- УУ рух, у) (3.18) 
(хь уђе Ra 


where the summation is over the points (x,, уу) in the range space R, corresponding to the event А. 
The joint cdf of a discrete bivariate r.v. (X, Y) is given by 


Fyy(X, у) = Y Y Pxy (Xi yj (3.19) 


хі<х yjSy 


C. Marginal] Probability Mass Functions: 


Suppose that for a fixed value X = x;, the r.v. Y can take on only the possible values y; (j = 1, 2, 
...,n) Then 


P(X = x) = ру(х) = уз Руү\Х, Уу) (3.20) 


where the summation is taken over all possible pairs (x;, y;) with x; fixed. Similarly, 


P(Y = y) = pj) = Y, Рууд, УУ (3.21) 


where the summation is taken over all possible pairs (х;, у) with y; fixed. The pmf's ру(х,) and p,(y,), 
when obtained by Eqs. (3.20) and (3.21), are referred to as the marginal pmf's of X and Y, respectively. 


D. Independent Random Variables: 
If X and Y arc independent r.v.'s, then (Prob. 3.10) 


PxyY (Xi, yj) = Px(<i)Py(y)) (3.22) 


82 MULTIPLE RANDOM VARIABLES [CHAP 3 


35 CONTINUOUS RANDOM VARIABLES—JOINT PROBABILITY DENSITY 2 
FUNCTIONS 


A. Joint Probability Density Functions: 
Let (X, Y) be a continuous bivariate r.v. with cdf Fyy(x, y) and let 


0? F yy(x, y) 
Ox ду 


The function fy,(x, y) is called the joint probability density function (joint pdf) of (X, Y) By 
integrating Eq. (3.23), we have 


Лу у) = (3.23) 


х У 
Fyy(x, у) = | | SxS, n) dn dg (3.24) 
B. Properties of fy,(x, y): 
L. fyylx, y) 20 (3.25) 
2 | | fxy(x; y) dx dy = 1 (3.26) 
3. fyy(x, y) is continuous for all values of x or y except possibly a finite set. 
4 P[(X, Ү)є А = [|| Sxyl%, y) dx dy (3.27) 
Ra 
4 b 
5. СЕЗЕ) [foots 9 ax ay (3.28) 


Since Р(Х = а) = 0 = P(Y = с) [by Eq. (2.19)], it follows that 
Pla<X<be<Y<d=Pia<X<bce<Yead=PlacX<bece<Y<d 


4 fb 
= Pa<X<be<¥<d=| [лао (3.29) 


С. Marginal Probability Density Functions: 
By Eq. (3.13), 


Fy(x) = Fyy(x, о) = | [ Ју, n) dn dg 


dF e 
Шеше ies — = | fede ta 
x -œ 
or fx(x) = | /уу(%, y) dy (3.30) 
Similarly, fi» = | Says y) dx (3.31) 


The pdf's f,(x) and fy(y), when obtained by Eqs. (3.30) and (3.31), are referred to as the marginal pdf's 
of X and Y, respectively. 
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D. Independent Random Variables: 
If X and Y are independent r.v.’s, by Eq. (3.4), 


Fyy(x, y) = Fy(x)Fy(y) 


9? Fyy(x, y) E 9 ô 
Then 95 ду Ox Р(х) ду (у) 
or Луб, у) = fa) Му) (3.32) 


analogous with Eq. (3.22) for the discrete case. Thus, we say that the continuous r.v.’s X and Y are 
independent r.v.’s if and only if Eq. (3.32) is satisfied. 


36 CONDITIONAL DISTRIBUTIONS 
A. Conditional Probability Mass Functions: 
If (X, Y) is a discrete bivariate r.v. with joint pmf pyy(x;, yj), then the conditional pmf of Y, given 
that X — x;, is defined by 
Xis ) 
prola = PRO рух) > 0 (3.33) 
Px(X,) 


Similarly, we can define руу(х;| yj) as 


р Xi ? ј 
рух] у) = PW рду) >0 (3.34) 
рү(у)) 
B. Properties of ру | x(y, | xj): 
l. OS pyx(yjIx) € 1 (3.35) 
2. Y pnxGlxd-m 1 (3.36) 
Уј 


Notice that if X and Y are independent, then by Eq. (3.22), 
PyixGjl x) = Py(y;) and Pxy (Xil yj = px(x) (3.37) 


C. Conditional Probability Density Functions: 


If (X, Y) is a continuous bvivariate r.v. with joint pdf /уу(х, y), then the conditional pdf of Y, given 
that X = x, is defined by 


foi = 200 sso (3.38) 
Similarly, we can define fy,y(x | y) as 
faci = 2050 fay) > 0 (3.39) 


D. Properties of fy | x(y |x): 
1. fixy) 20 (3.40) 


2. [ Srxv |x) dy = 1 (3.41) 


84 MULTIPLE RANDOM VARIABLES [CHAP 3 


As in the discrete case, if X and Y are independent, then by Eq. (3.32), 
Fyix( Ix) = fyly) and Лхуу(Х1 у) = fx(x) (3.42) 


37 COVARIANCE AND CORRELATION COEFFICIENT 
The (k, n)th moment of a bivariate r.v. (X, Y) is defined by 


У Y xi y pxy(Xi yj) (discrete case) 
m, = E(X*Y") = 45," e. (3.43) 
хку"/уу(х, y) dx dy (continuous case) 


If n = 0, we obtain the kth moment of X, and if k = 0, we obtain the nth moment of Y. Thus, 

mio = E(X) = ux and mo, — E(Y) 2 uy (3.44) 
If (X, Y) is a discrete bivariate r.v., then using Eqs. (3.43), (3.20), and (3.21), we obtain 

их = E(X) = Y Y Xi Pxy(Xi, yj) 


yj xi 
=}, Ху Рхү(х;› »| = Y xipx(x) (3.45a) 
xi yj хі 
By = Е(Ү) = Y У у;рху(х;, Y) 
Xi у) 


=) [x Pxy (Xi, »| =} yp) (3.45b) 
У} х yj 


Similarly, we have 
E(X?) = У Yx?pxX; y) = У. x! py(xy) (3.46a) 


yj хі 


E(Y?) = Y Y y? py. y) = x уруу) (3.46) 


yy хі 


If (X, Y) is a continuous bivariate r.v., then using Eqs. (3.43), (330) and (3.31), we obtain 


Hy = E(X) = f [ Xfxy(x, y) dx dy 


= u AT. i Sxy(x, y) a| dx = F xfx(x) dx (3.47a) 
шү = E(Y) = E [` Wxylx, y) dx dy 
= f ||. Лбх, y) | dy = i Yf) dy (3.47b) 
Similarly, we have 
E(X?) = F [. X!foy(x, y) dx dy = F хрх) dx (3.48a) 


Е(Ү?) = [` E ух, у) dx dy = [` угуу) dy (3.48b) 
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The variances of X and Y are easily obtained by using Eq. (2.31). The (1, 1)th joint moment of (X, Y), 
mi, = E(XY) (3.49) 
is called the correlation of X and Y. If E(X Y) = 0, then we say that X and Y are orthogonal. The 
covariance of X and Y, denoted by Cov(X, Y) or oxy, is defined by 
Cov(X, Y) = oxy = E[(X — ХҮ — py)] (3.50) 
Expanding Eq. (3.50), we obtain 
Cov(X, Y) = E(XY) — E(X)E(Y) (3.51) 
If Cov(X, Y) = 0, then we say that X and Y are uncorrelated. From Eq. (3.51), we see that X and Y 
are uncorrelated if 
E(XY) = E(X)E(Y) (3.52) 
Note that if X and Y are independent, then it can be shown that they are uncorrelated (Prob. 
3.32), but the converse is not true in general; that is, the fact that X and Y are uncorrelated does not, 


in general, imply that they are independent (Probs. 3.33, 3.34, and 3.38). The correlation coefficient, 
denoted by p(X, Y) or руу, is defined by 


CowX,Y) _ oxy 


A(X, Y) = руу = 
Ox Oy Ox Oy 


(3.53) 


It can be shown that (Prob. 3.36) 
[рху| El or —1< Pxy < 1 (3.54) 


Note that the correlation coefficient of X and Y is a measure of linear dependence between X and Y 
(see Prob. 4.40). 


3.8 CONDITIONAL MEANS AND CONDITIONAL VARIANCES 


If (X, Y) is a discrete bivariate r.v. with joint pmf pyy(x;, уу), then the conditional mean (or condi- 
tional expectation) of Y, given that X = х;, is defined by 


Byla = E(Y |x) = Y Y; Pix Qj] x) (3.55) 
yj 


The conditional variance of Y, given that X = x,, is defined by 
сүр, = Var(Y |х) = E[(Y — uy? 1xi] = È (Уу Brid Prix Gi; l2 G.56) 
ИЛ 


which сап be reduced to 

Var(Y | x) = E(Y?|x) — [E(Y | x)? (3.57) 
The conditional mean of X, given that Y — y;, and the conditional variance of X, given that Y — yp 
are given by similar expressions. Note that the conditional mean of Y, given that X — x;, is a func- 
tion of x; alone. Similarly, the conditional mean of X, given that Y = y;, is a function of y; alone. 


If (X, Y) is a continuous bivariate r.v. with joint pdf fyy(x, y), the conditional mean of Y, given 
that X — x, is defined by 


By = E(Y |x) = [` УЈх(уіх) dy G.58) 


The conditional variance of Y, given that X — x, is defined by 


oo 


оу, = Var(Y |x) = E[(Y — n^ |x] = f 0- yy) frix |x) dy (3.59) 
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which can be reduced to 
Var(Y |x) = E(Y?|x) — LE(Y | х)]? (3.60) 


The conditional mean of X, given that Y = y, and the conditional variance of X, given that Y = y, 
are given by similar expressions. Note that the conditional mean of Y, given that X — x, is a function 
of x alone. Similarly, the conditional mean of X, given that Y — y, is a function of y alone (Prob. 
3.40). 


39 N-VARIATE RANDOM VARIABLES 


In previous sections, the extension from one r.v. to two r.v.'s has been made. The concepts can be 
extended easily to any number of r.v.’s defined on the same sample space. In this section we briefly 
describe some of the extensions. 


A. Definitions: 


Given an experiment, the n-tuple of r.v.’s (Xi, X5, ..., X,) is called an n-variate r.v. (or n- 
dimensional random vector) if each X;, i= 1, 2, ..., n, associates a real number with every sample 
point £ e S. Thus, an n-variate r.v. is simply a rule associating an n-tuple of real numbers with every 
tes. 


Let (X,,..., X,) be an n-variate r.v. on S. Then its joint cdf is defined as 
Frys хх х) = P(X i S xy. X, E xj) (3.61) 
Note that 
Fy... x,(,..-, ©) = 1 (3.62) 


The marginal joint cdf's are obtained by setting the appropriate X;'s to +o in Eq. (3.61). For 
example, 


Ех, xg (Xa ++, Xn) = Fx x ax (Xp o Xp 00) (3.63) 
Ех, X1) = Fxixaxy x Xi X2» ©©,..., 00) (3.64) 

A discrete n-variate r.v. will be described by a joint pmf defined by 
рх, хх: Xn) = Р(Х, = xy, 000, Х„ = x) (3.65) 


The probability of any n-dimensional event A is found by summing Eq. (3.65) over the points in the 
n-dimensional range space R, corresponding to the event A: 


PU(X,,...,X) € Al = У-У Prex o Xn) (3.66) 


(xi, s Ха) є Ra 


Properties of px, ... x (x1, <+- Xn): 
l. OS py, x (X, o x) l (3.67) 
2. Y: Xp ee x (My s Xl (3.68) 
X1 Xn 


The marginal pmf's of one or more of the r.v.’s are obtained by summing Eq. (3.65) appropriately. 
For example, 


Prixa 10х05 o 1) m У руу... х(Ху›...› X) (3.69) 


Py) = Y i Y Pa s X) (3.70) 
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Conditional pmf's are defined similarly. For example, 


Рх, xi: tt Xn) 


Pxyxi хи (Ха Хә +++, Ха) = 
nlX1 n-1V^n Н п 1 PX х, o Хаа) 


A continuous n-variate г.у. will be described by a joint pdf defined by 


- OT Fy, e (Xy o X) 





fuia sax o s Xa) 


OX, +++ Ox, 
Then Fy,... хх, tto. x,) = F UU K Sx, xs өө» e) dé, "Utt dé, 
and P[(X,,..., X,) € А] = fof fx sx En ves 6) dy 4, 
(Xp, 0, Xa) € RA 


Properties of fx, ... x (x1, +65 Xa): 
хоо х) > 0 


2. | -f Sri xg Xs css Xn) @ху х, = 1 
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(3.71) 


(3.72) 


(3.73) 


(3.74) 


(3.75) 


(3.76) 


The marginal pdf's of one or more of the r.v.’s are obtained by integrating Eq. (3.72) appropriately. 


For example, 


© 
fx, Xa X ttt Xn-1) = | Sx, 61036 эс» Xn) dx, 
-œ 


Ља) = f UC f fx xo ry xj) dx; tt dx, 


Conditional pdf’s are defined similarly. For example, 


fx x X tt Xn) 
fx osx tty X, - 1) 


The r.v.'s X, ..., X, are said to be mutually independent if 


Sanxi Xa Xi eeo Хы) = 


руу xs Xn) = Приб) 


for the discrete case, апа 


Sx, Xp ttt Xn) = П Лх) 


for the continuous case. 
The mean (or expectation) of X, in (X,,..., X,) is defined as 


узб У хрх, xp so X) (discrete case) 


щ= ЕХ) = уча " oc 
| nee | Xi fxs xp +, X) dx, + dx (continuous case) 


n 
= ос — 


The variance of X; is defined as 
ог = Var(X) = ЕХ, — n)?] 


(3.77) 


(3.78) 


(3.79) 


(3.80) 


(3.81) 


(3.82) 


(3.83) 
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The covariance of X, and X; is defined as 


9; = Cov(X;, X) = ERX, — ААХ; — n3] (3.84) 
The correlation coefficient of X; and X; is defined as 
C o X; И 
„= Yn L в. (3.85) 
0,0; 0,0; 


310 SPECIAL DISTRIBUTIONS 
A. Multinomial Distribution: 


The multinomial distribution is an extension of the binomial distribution. An experiment is 


termed a multinomial trial with parameters p,, p2,..., py, if it has the following conditions: 
1. The experiment has k possible outcomes that are mutually exclusive and exhaustive, say A,, A5, 
wavy Ag. 
k 
2. P(A) = pi i=1,...,k and Y p=! (3.86) 
i=l 


Consider an experiment which consists of n repeated, independent, multinomial trials with param- 
eters p,, P2,..., py. Let X; be the r.v. denoting the number of trials which result in A;. Then (X,, X2, 
..., Xj) is called the multinomial r.v. with parameters (и, p,, P2, ..., p,) and its pmf is given by (Prob. 
3.46) 


рхух› tee xv Xz ‚+з ху) = a er p, p? Ut рк“ (3.87) 
1 . Ге 


k 
for xj; =0,1,...,2,i=1,...,k, such that Y x; =n. 
i=] 


Note that when k = 2, the multinomial distribution reduces to the binomial distribution. 


B. Bivariate Normal Distribution: 
A bivariate r.v. (X, Y) is said to be a bivariate normal (or gaussian) r.v. if its joint pdf is given by 


1 | 
Љу, у) = лоо py exp[ — 24(х, y)] G.88) 


— 2 _ _ _ 2 
where q(x, у) = i — (е) — (Eee) + (=) | (3.89) 


and Hy, Hy, Cx’, су? are the means and variances of X and Y, respectively. It can be shown that p is 
the correlation coefficient of X and Y (Prob. 3.50) and that X and Y are independent when р = 0 
(Prob. 3.49). 





C. N-variate Normal Distribution: 


Let (X,, ..., X,) be an n-variate r.v. defined on a sample space S. Let X be an n-dimensional 
random vector expressed as an n x 1 matrix: 


x=|: (3.90) 
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Let x be an n-dimensional vector (n x 1 matrix) defined by 


X; 
x=]: (3.91) 
Xn 
The n-variate r.v. (X ,, ..., X,) is called an n-variate normal г.у. if its joint pdf is given by 
l 1 Ty -1 
fx) = (ny? |det К]? gl - a(x — p) K` (x—m (3.92) 
where T denotes the “transpose,” p is the vector mean, K is the covariance matrix given by 
Hi E(X.) 
=E X =|: |=| : 
p = E[X] : : (3.93) 
Hs E(X,) 
Fy, 77 Cin 
К=|. ^^. : с; = Cov(X;, Xj) (3.94) 
951 Ut Onn 
and det К is the determinant of the matrix K. Note that fy(x) stands for fy, ... x (xi, ..., x,). 
Solved Problems 


BIVARIATE RANDOM VARIABLES AND JOINT DISTRIBUTION FUNCTIONS 


3.1. Consider an experiment of tossing a fair coin twice. Let (X, Y) be a bivariate r.v., where X is the 
number of heads that occurs in the two tosses and Y is the number of tails that occurs in the two 


tosses. 

(a) What is the range Ry of X? 

(b) What is the range Ry of Y? 

(c) Find and sketch the range Ry y of (X, Y). 

(d) Find P(X = 2, Y = 0), Р(Х = 0, Y = 2), and P(X = 1, Y = 1) 


The sample space S of the experiment is 
S = (HH, HT, TH, TT} 

Rx = {0,1,2} 

Ry = {0, 1, 2} 

Куу = {(2, 0), (1, 1), (0, 2)) which is sketched in Fig. 3-2. 

Since the coin is fair, we have 
P(X = 2, Y =0)= P(HH) =} 
P(X =0, Y =2)=P{TT} -i 
P(X =1, Y = 1) = P{HT, TH} -i 


3.2. Consider a bivariate r.v. (X, Y). Find the region of the xy plane corresponding to the events 


A={X+Y <2} В = (X? + Y? <4} 
C = {min(X, Y) x 2} D = (max(X, Y) x 2} 
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3.3. 


3.4. 


3.5. 


The region corresponding to event A is expressed by x + y < 2, which is shown in Fig. 3-3(a), that is, 
the region below and including the straight line x + y = 2. 

The region corresponding to event B is expressed by x? + y? < 2?, which is shown in Fig. 3-3(b), that 
is, the region within the circle with its center at the origin and radius 2. 

The region corresponding to event C is shown in Fig. 3-3(c), which is found by noting that 


(min(X, Y) 2) = (X x2) o (Y x 2) 
The region corresponding to event D is shown in Fig. 3-3(0), which is found by noting that 


(max(X, Y) <2} = (X € 2) n (Y <2) 


Verify Eqs. (3.7), (3.8a), and (3.85). 
Since (X < oo, Y < со} = $ and by Eq. (1.22), 
P(X < oo, Y € oc) = Ёуү(со, оо) = P(S) 21 
Next, as we know, from Eq. (2.8), 


Р(Х = – о) = Р(Ү = – оо) = 0 








Since (X2—o,Yzy)cí(X-2-—2o) and (Xxx Yx-—o)c(Y»- ~ о) 


and by Eq. (1.27), we have 
P(X = —o, Y < y) = Fe- œ, у) = 0 


Р(Х <x, Y = —0) = Fyyx, — оо) = 0 


Verify Eqs. (3.10) and (3.11). 
Clearly (Xzx,Yzxy-(Xzx,Yzywuo(x«Xzx,Yzy) 





The two events on the right-hand side are disjoint; hence by Eq. (1.23), 
P(X € x,, Y X y) = Р(Х € x, Y € y) à Pix, « X € x,, Y X y) 
or P(x, < X < x, Y <y)=P(X <x, VY Sy) -P(X Sx, Y<y) 
= Куү(х›, y) — Ёуү(х\, у) 
Similarly, 
(XzxYzy)e9(XzxYsy)o(Xsxy,«Yxy) 
Again the two events on the right-hand side are disjoint, hence 
P(X <x, Ү<у,) = P(X <x, Ү<у) + Р(Х <х, у < Y¥ < yo) 
or P(X <x, у < Y<y) = РКХ <x, ¥ Sy.) -—-P(X <x, Y<y) 
= Fyy(x, уз) — Frye, yi) 


Verify Eq. (3.12). 
Clearly 
(x, < X $x, Ү< у) =(X,< XS x.,¥ Sy) о(х <X <х,,у,<Ү<у,) 
The two events on the right-hand side are disjoint; hence 
P(x, < X SX, Y S yo) = Р(х < X Sx, Y<y) + Р(х < Х<х,,у < Y < y) 
Then using Eq. (3.10), we obtain 


Р(х < X < x2, 9, < Y < ya) = Р(х < X < x3, Y < y3) – Р(х < X < x2, Y Sy,) 
= Ех, y3) — Ёуүү(х\, У) — [Ех yi) — Fxy(x y, У) 
= Fryy(%25 у) — Ех Уз) — Ёхү(Х;, у) + Ёуү(хү, у,) (3.95) 
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3.7. 
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Since the probability must be nonnegative, we conclude that 
Еуү(х›, Ya) — FxylXi Уу) — FyyGxa, у) + Ёуү(ху, уу) 20 


if x, > x, and y; 2 y,. 


Consider a function 
1—e 0*» Qzx«o,0xy«o 
0 otherwise 


F(x, »-| 


Can this function be a joint cdf of a bivariate r.v. (X, Y)? 


[CHAP 3 


It is clear that F(x, y) satisfies properties 1 to 5 of a cdf [Eqs. (3.5) to (3.9)]. But substituting F(x, y) in 


Eq. (3.12) and setting х, = y; = 2 and x, = y, = 1, we get 


F(2, 2) — Е(1, 2) – F(2, 1) + FU, D2(1—e*-(1—e9)—(1—e7)(1—e7?) 


= е7 + 2673 672 = – (072 е!) < 0 


Thus, property 7 [Eq. (3.12)] is not satisfied. Hence F(x, y) cannot be a joint cdf. 


Consider a bivariate r.v. (X, Y). Show that if X and Y are independent, then every event of the 


form (a < X < Б) is independent of every event of the form (c < Y < d). 
By definition (3.4), if X and Y are independent, we have 
F yy(x, y) = Fy(x)F y(y) 
Setting x, = a,x, = b, y, = c, and y, = d in Eq. (3.95) (Prob. 3.5), we obtain 
P(a < X X b, c < Y € d) = Fyy(b, d) — Fyy(a, d) — Fyy(b, c) + Руа, c) 

= Fyx(b)Fy(d) — Fy(a)P yd) — Fx(b)F fc) + Fyla)F io) 
= [Fx(b) — F x(a) LF yd) — Еү(с)] 
= Pía« X x b)P(c « Y x d) 

which indicates that event (a < X < b) and event (c < Y < d) are independent [Eq. (1.46)]. 


The joint cdf of a bivariate r.v. (X, Y)is given by 
(1—e7*X1—e?) х> 0, у> 0, а, > 0 
Fy fx, у) = . 
xis, y) t otherwise 

(a) Find the marginal cdf's of X and Y. 
(b) Show that X and Y are independent. 
(c) Find P(X < 1, Y < I), P(X x 1), P(Y > t), and P(X > x, Y > y) 
(а) By Eqs. (3.13) and (3.14), the marginal cdf’s of X and Y are 
]—e'** x20 
0 х< 0 
1—-e? у>0 
0 у<0 
(b) Since Fy,(x, y) = Fx(x)Fy(y), X and Y are independent. 
(б) PXxLY <1)= Е, 1,10) = (1-е е2) 

P(X < 1) = РД) =(1-е7°) 

P(Y > 1) =1- Р(Ү<1) = 1 — Е) = e`? 


Fx(x) = Fyy(x, co) = | 


Fy(y) = Fxy(, y) = i 


CHAP. 3] MULTIPLE RANDOM VARIABLES 93 


By De Morgan’s law (1.15), we have 
X> 0) (у> у) = (Хх) о Y> у) = (Хх) о (уху) 
Then by Eq. (1.29), 
PIX »x)o^(Y»y)-PXx»PYzy-PXzxYxy 
= Fy(x) + Еу) — Ехубх, y) 
=(l-e ™)+(1-e°”) -(1 —e7™X1 — e7”) 
2|-e Ug» 
Finally, by Eq. (1.25), we obtain 
PX»xY»y-I-P(X»3n(Y»y]-e' "e 


39. Тһе joint cdf of a bivariate r.v. (X, Y) is given by 
0 х<0 or у<0 
Py О<х<а O<y<b 
Fyyx,y)— ўр xza, Os y<b 
Рз О<х<а, y>b 
1 х>а, yzb 


(а) Find the marginal cdf’s of X ава Y. 
(b) Find the conditions on p,, p2, and p, for which X and Y are independent. 


(a) By Eq. (3.13), the marginal cdf of X is given by 


0 x<0 
F(x) = Fyylx, ©) = {р  Oxx«a 
1 х>а 
By Eq. (3.14), the marginal cdf of Y is given by 
0 у<0 
Еу) = Fy. у) = 4р O<y<b 
1 у> Б 


(b) For X and Y to be independent, by Eq. (3.4), we must have Р(х, y) = Fy(x)F,(y). Thus, for 0 < x < a, 
0 € y < b, we must have p, = pz; p; for X and Y to be independent. 


DISCRETE BIVARIATE RANDOM VARIABLES—JOINT PROBABILITY MASS 
FUNCTIONS 


3.10. Verify Eq. (3.22). 
If X and Y are independent, then by Eq. (1.46), 
рхү(х;, y = P(X = x, Y = yp = Р(Х = x)P(Y = у) = px(x)py(yj 


3.11. Two fair dice are thrown. Consider a bivariate r.v. (X, Y). Let X = 0 or 1 according to whether 
the first die shows an even number or an odd number of dots. Similarly, let Y — 0 or 1 according 


to the second die. 
(a) Find the range Куу of (X, Y). 
(b) Find the joint pmf's of (X, Y). 
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(a) The range of (X, Y) is 


Ryy = {(0, 0), (0, 1), (1, 0), (1, 1)) 


(b) Itis clear that X and Y are independent and 


Thus 


Р(Х = 0) = Р(Х = 1) = = { 
P(Y = 0) = Р(Ү = 1) = 2 


Рх, ђ = P(X =i, Y =j = Р(Х = )Р(Ү=ђ=4ф іј= 0,1 


3.12. Consider the binary communication channel shown in Fig. 3-4 (Prob. 1.52). Let (Х, Y) be a 
bivariate r.v., where X is the input to the channel and Y is the output of the channel. Let 


P(X 
(a) 
(b) 
(c) 
(a) 


(b) 


= 0) = 0.5, P(Y = 1| X = 0) = 0.1, and (Y = 0|Х = 1) = 0.2. 
Find the joint pmf’s of (X, Y). 

Find the marginal pmf’s of X and Y. 

Are X and Y independent? 

From the results of Prob. 1.52, we found that 


P(X =1)=1— P(X =0)=05 
Р(Ү =0|Х = 0) = 0.9 P(Y = 1|Х = 1) = 08 


Then by Eq. (1.41), we obtain 
P(X 20, Y = 0) = P(Y =0|Х =0)P(X = 0) = 0.9(0.5) = 0.45 
P(X =0, Y = 1) = P(Y = 1|Х =0)P(X = 0) = 0.10.5) = 0.05 
P(X =1, Y =0)= P(Y =0|Х = Р(Х = 1) = 0.2(0.5) = 0.1 
Р(Х =1, Y = 1) = P(Y =1|Х =1)P(X = 1) = 0.8(0.5) = 04 
Hence, the joint pmf’s of (X, Y) are 
Pxy(0, 0) = 0.45 рхү(0, 1) = 0.05 
pxy(1, 0) = 0.1 рхү(1, 1) = 0.4 
By Eq. (3.20), the marginal pmf’s of X are 
px(0) = У pyy(0, y) = 0.45 + 0.05 = 0.5 
» 


px(l) = Y pxy(l, y) = 0.1 +04 = 0.5 


yy 

By Eq. (3.21), the marginal pmf's of Y are 
рү(0) = Y. pyy(x;, 0) = 0.45 + 0.1 = 0.55 
рү(1) = У, pyy(x;, 1) = 0.05 + 0.4 = 0.45 


Xi 


P(Y 201X = 0) 





Р(Ү=- ИХ = |) 


Fig. 3-4 Binary communication channel. 
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(c) Now 
рх(0)ру(0) = 0.5(0.55) = 0.275 5 pxy(0, 0) = 0.45 


Hence X and Ү are not independent. 


3.13. Consider an experiment of drawing randomly three balls from an urn containing two red, three 
white, and four blue balls. Let (X, Y) be a bivariate r.v. where X and Y denote, respectively, the 
number of red and white balls chosen. 


(a) Find the range of (X, Y). 

(b Find the joint pmf's of (X, Y). 

(c) Find the marginal pmf's of X and Y. 
(d) Are X and Y independent? 


(a) The range of (X, Y) is given by 
Куу = {(0, 9), (0, 1), (0, 2), (0, 3), (1, 0), (1, 1), (1, 2), (2, 0), (2, D) 
(b) The joint pmf's of (X, Y) 
Pxyli, J) = P(X =i, Y =/) i=0,1,2 j=0,1,2,3 


are given as follows: 


rane OO- 
woa- (A-E из QUO) 

miro) OOOO- 
моа C-E 

weoi а-ә 


which are expressed in tabular form as in Table 3.1. 


(c) The marginal pmf’s of X are obtained from Table 3.1 by computing the row sums, and the marginal 
pmf's of Y are obtained by computing the column sums. Thus 


px(0) = # px(1) = 52 рх(2) = 3s 
py(0) = ag рү(1) = & рү(2) = Lr Py(3) = ңа 


Table 3.1 py, j) 
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3.15. 
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(d) Since 
Pxy(0, 0) = ga # px(O)py(0) = 34 (82) 


X and Y are not independent. 


The joint pmf of a bivariate r.v. (X, Y) is given by 
k(2x; + yj) x,=1,2;y,=1,2 
Pxy(Xi, У) = to / ; 


otherwise 


where k is a constant. 
(a) Find the value of k. 
(b) Find the marginal pmf’s of X and Y. 
(c) Are X and Y independent? 
(a) By Eq. (3.17), 
2 2 
У У Pxy(Xi. у) = У У k(2x; + yj 


xi у) х= yj=1 
= k[(2 + D-2 -2 -(4- 1) + (4 + 2)] = k(18)= 1 
Thus, k = 45. 
(b) By Eq. (3.20), the marginal pmf's of X are 
2 
Px(X;) = Y Рҳү(х;, yj = У ix, + yj 
yy y-t 
= (2x; + 1) + (2x; + 2) = (4х; + 3) х= 1, 2 


By Eq. (3.21), the marginal pmf’s of Y are 


2 
р) = È Рху(Хх› y) = У, т8(2х + у)) 
Xi х=! 
= (2 + y) + (4 + y) = (2y; + 6) уу= 1,2 
(с) Now px(x)Py(yj) # Рху(х;. у); hence X and Y are not independent. 


The joint pmf of a bivariate r.v. (X, Y) is given by 
кх?у, x, = 1,2; y,= 1, 2,3 

Paesi, y) = f otherwise 
where k is a constant. 
(a) Find the value of k. 
(b) Find the marginal pmf's of X and Y. 
(c) Are X and Y independent? 
(a) By Eq. (3.17), 

2 3 
Y Y Pxyl(%is y) = Y Y kx)?y, 


xi yj xi= l уу=1 


=k1+24+34+4484 12) = k30) = 1 


Thus, К = 45. 
(b) By Eq. (3.20), the marginal pmf’s of X are 
3 
px(x) = Y Pxy(Xi, у) = Y TX); = ix x= 1,2 


»j yal 
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By Eq. (3.21), the marginal pmf's of Y are 
2 
Pyly;) = У Pyylx;, yj = У 3oxi yj = iy; у; = 1, 2, 3 
Xj х= 1 
(с) Now 
Px(x))Py(y;) = 30X Y) = pxy(xi, yj 


Hence X and Y are independent. 


3.16. Consider an experiment of tossing two coins three times. Coin A is fair, but coin B is not fair, 
with P(H) = 1 and P(T) = 2. Consider a bivariate r.v. (X, Y), where X denotes the number of 
heads resulting from coin A and Y denotes the number of heads resulting from coin B. 
(a) Find the range of (X, Y). 
(b) Find the joint pmf's of (X, Y). 
(c) Find P(X = Y), P(X > Y), and P(X + Y < 4). 
) The range of (X, Y) is given by 
Куу = {(i j): i j = 0, 1, 2, 3} 


(Б) It is clear that the r.v.’s X and Y are independent, and they are both binomial r.v.'s with parameters (л, 
р} = (3, 3) and (n, р) = (3, 4), respectively. Thus, by Eq. (2.36), we have 


3MVIV 1 3\f1\3 
nosar] иемин (XQ - 


1 3 

8 8 
мамке (06) =3 000 sem) -; 
Py(0) = P(Y = 0) = ӨРӨ -2 py(l) = P(Y = 0 = CX) à E я 
warra- (E-A rora (OE -A 


Since X and Y are independent, the joint pmf's of (X, Y) are 
Pxvli, J) = Рур) ј= 0, 1 2,3 


which are tabulated in Table 3.2. 
(c) From Table 3.2, we have 


3 
P(X = Ү) = Y pui. ) = 98507 + 814+ 27+ D = 126 
i20 


3 
AX > Y) = Уруу) = Pyy(l, 0) + P440, 0) + P4,(. 0) + pyy(2, 1) + руу(3, 1) + руу(3, 2) 


i>} 


= 3h (81 +81 + 27 + 81 +27+9 = 39$ 


Table 3.2 pyy(i, Л 
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P(X +Y>4)= У ре, j) = Руу(2, 3) + Pyy(3, 2) + Pxy(3, 3) 


itj> 
= gh 9 1) = 45 
499 


Thus, Р(Х +у<4)=1—Р(Х+у>»4)=1—3Д% = 8 


CONTINUOUS BIVARIATE RANDOM VARIABLES—PROBABILITY DENSITY 
FUNCTIONS 


3.17. The joint pdf of a bivariate r.v. (X, Y) is given by 
fod у) = k(x + y) 0<x<2,0<y<2 
xe Y= 19 otherwise 
where k is a constant. 


(а) Find the value of k. 
(b) Find the marginal pdf’s of X and Y. 
(c) Are X and Y independent? 


(a) By Eq. (3.26), 
» pe 2 {2 
| | Ју y) dx a= [ (x + y) dx dy 


24g 
=k (5+) 
LG 


2 
ETETE 
0 


x-2 
dy 


x=0 





Thus k = 4. 
(b) By Eq. (3.30), the marginal pdf of X is 


ос l 2 
hx) = f Ју y) dy = 8 | (х + у) ду 


1 у? 
EE >) 


Since fyy(x, y) is symmetric with respect to x and y, the marginal pdf of Y is 


0 otherwise 





ce fer 0«x«2 


yz0 


1 
_ Jay t D 0<у<2 
fi» = RN otherwise 


(c) Since fyy(x, у) £ Лх) fy), X and Y are not independent. 


3.18. The joint pdf of a bivariate r.v. (X, Y) is given by 
Far y) = nd odere CM 
where k is a constant. 
(a) Find the value of k. 
(b) Are X and Y independent? 
(c) Find P(X +Y < 1). 
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у у 
1 1 > 
> 
0 1 0 t 


(a) (b) 
Fig. 3-5 
(а) The range space Ryy is shown in Fig. 3-5(a). By Eq. (3.26), 


x m 1 | i in 
| | уух, y) dx dy = | Е dx a= « (Do 
CUT MN » V2 o 


Thus k = 4. 
(b) To determine whether X and Y are independent, we must find the marginal pdf's of X and Y. By Eq. 
(3.30), 
1 
4ху dy = 2x O<x<l1 
fle) = [ 
0 otherwise 
By symmetry, 
_ ]2у O<y<l 
hin = f otherwise 


Since fyy(x, у) = fx) AW), X and Y are independent. 
(c) The region in the xy plane corresponding to the event (X + Y < 1) is shown in Fig. 3-5(Ь) as a shaded 


area. Then 
t 1-y 1 х2 1—y 
px e rens [ | by axdy = [ (S Jo 
о Jo o o 


1 
= | 4y[30 — VP] dy = 5 
0 
3.19. The joint pdf of a bivariate r.v. (X, Y) is given by 





kxy O<x<y<l 
fois y) = M otherwise 
where & is a constant. 


(a) Find the value of k. 
(b) Are X and Y independent? 


(a) The range space Йу, is shown in Fig. 3-6. By Eq. (3.26), 


= © 1 ру 1 fy? 
{ { ди ds dy =k | Poet DS 
cx; т x, о О o 2 


1 3 
y k 
=k — dy=-= 1 
rs Y7g 
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Fig. 3-6 


Thus k = 8. 
(b) By Eq. (3.30), the marginal pdf of X is 
1 
8xy dy = 4x(1 — x?) 0cx«l 
f(x) = | 
0 otherwise 


By Eq. (3.31), the marginal pdf of Y is 
y 
[sx ax c p O<y<l 


0 
0 otherwise 


fy = 


Since fyy(x, y) # /у(х) f(y), X and Y are not independent. 
Note that if the range space R yy depends functionally on x or y, then X and Y cannot be indepen- 
dent r.v.'s. 


3.20. The joint pdf of a bivariate r.v. (X, Y) is given by 
k О<у<х<1 
Љу у) = f otherwise 
where k is a constant. 


(a) Determine the value of k. 

(b) Find the marginal pdf’s of X and Y. 

(с) Find PO < X <4,0< Y < 1). 

(a) The range space Ryy is shown in Fig. 3-7. By Eq. (3.26), 


F Г fy, у) dx dy = k [|| dx dy = k x area(Ryy) = КЗ) = 1 


Rxy 


Thus k = 2. 
(b) By Eq. (3.30), the marginal] pdf of X is 


2 dy = 2x О<х<1 
А0) = [ 


0 otherwise 
By Eq. (3.31), the marginal pdf of Y is 


1 
fly) = [24-2075 О<ус<1 


0 otherwise 
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Fig. 3-7 


The region in the xy plane corresponding to the event (0 < X < 4, 0 < Y < 4) is shown in Fig. 3-7 as 
the shaded area R,. Then 


PO<X <4,0<¥<}=P0<X¥ <4,0<Y<X) 
= {| Seylx, у) dx dy = 2 {| dx dy = 2 x area(R,) = X) = 4 
В, К; 
Note that the bivariate г.у. (X, Y) is said to be uniformly distributed over the region Күү if its pdf is 


k (x, y) € Ry, 


Jus y) = i (3.96) 


otherwise 


where k is a constant. Then by Eq. (3.26), the contant k must be k = 1/(area of Ryy). 


Suppose we select one point at random from within the circle with radius R. If we let the center 
of the circle denote the origin and define X and Y to be the coordinates of the point chosen (Fig. 
3-8), then (X, Y) is a uniform bivariate r.v. with joint pdf given by 


x? + y? < К? 


k 
fy. y) = lo x? + y? > R? 


where k is a constant. 


(a) 
(b) 
(с) 


Determine the value of К. 
Find the marginal pdf’s of X and Y. 


Find the probability that the distance from the origin of the point selected is not greater 
than a. 


Fig. 3-8 
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(а) Ву Eq. (3.26), 


Í | Г Лх, y) dx dy = К [|| dx dy = k(R?) = 1 


x2+yl<R2 
Thus k = I/zR?. 
(b) By Eq. (3.30), the marginal pdf of X is 
1 RI х1) 4 
=— dy = 5 /R?-x? х < R? 
Љо) nR? <- ART- х1 y nR? * * 
2 
— R? _ ү? < 
Непсе Лх) = (aR? х IxISR 
0 Ix| > К 


By symmetry, the marginal pdf of Ү is 
Sy) = aR? 


(с) ForO<a<R, 


P(X? + Y? <a)= {| Хбх, y) dx dy 


x2+y2 sal 


1 na? a? 
=— d f „^5 
nR? {| x dy nR? К? 


x21 y2sa2 


3.22. The joint pdf of a bivariate r.v. (X, Y) is given by 


fx. y) ke (ex tby) x>0,y>0 
X, = . 
xv у 0 otherwise 


where a and b are positive constants and К is a constant. 


(a) Determine the value of k. 
(b) Are X and Y independent? 


| | Spx, y) dx dy =k | | g ***» dx dy 
-a J-a o do 


2 © k 
-*| etas е7» Ду = — = 1 
b o ab 


(а) By Eq. (3.26), 


Thus k = ab. 
(b) By Eq. (3.30), the marginal pdf of X is 


E: 


е ду = ае " x>0 


Ах) = abe ** | 


О 


Ву Eq. (3.31), the marginal pdf of Y is 


A) = are | e " dx = he ™ у> 0 


a 


Since fy (x. y) = f(x) Ру), X and Y are independent. 
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3.23. А manufacturer has been using two different manufacturing processes to make computer 
memory chips. Let (X, Y) be a bivariate r.v., where X denotes the time to failure of chips made 
by process A and Y denotes the time to failure of chips made by process B. Assuming that the 
joint pdf of (X, Y) is 


abe" 9+5) x>0, y>0 
fas у) = f otherwise 


where a = 1074 and b = 1.2(107*), determine Р(Х > Y). 


The region in the xy plane corresponding to the event (X > Y) is shown in Fig. 3-9 as the shaded area. 
Then 


nx > ¥) = ab | 


o 


= ab { e 9 | e a| dx=a | e^ (1 — e 9) dx 
0 o o 


b 1.2(107 5) 


° [ее dy ax 
o 


= =^ = 0545 
a+b (1-12X1075) 
y 
у=х 
Xx 
0 
Fig. 3-9 


3.24. A smooth-surface table is ruled with equidistant parallel lines a distance D apart. A needle of 
length L, where L x D, is randomly dropped onto this table. What is the probability that the 
needle will intersect one of the lines? (This is known as Buffon's needle problem.) 


We can determine the needle's position by specifying a bivariate r.v. (X, ©), where X is the distance 
from the middle point of the needle to the nearest parallel line and © is the angle from the vertical to the 
needle (Fig. 3-10). We interpret the statement “the needle is randomly dropped" to mean that both X and © 
have uniform distributions and that X and © are independent. The possible values of X are between 0 and 











Fig. 3-10  Buffon's needle problem. 
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D/2, and the possible values of © are between 0 and 2/2. Thus, the joint pdf of (X, ©) is 


4 


/хе(х, ©) = fx) fol) = лр 
0 


otherwise 


0<xs5,0<65 


[CHAP 3 


From Fig. 3-10, we see that the condition for the needle to intersect a line is X « L/2 cos 6. Thus, the 


probability that the needle will intersect a line is 


L nj2 (Lj 2)cos 0 
p(x < Ecos) = f { 
2 iu О 





fxe(x. 0) dx dé 


4 2 (L/2)cos 0 
SP Je 
4 FPL 2L 
=> | 5 0086 dô = zb 
CONDITIONAL DISTRIBUTIONS 
3.25. Verify Eqs. (3.36) and (3.41). 
(a) By Eqs. (3.33) and (3.20), 
У Рух(х;, у) 
Y риу) = 2 = PAD | 
yj x(xi) px(xj) 
(b) Similarly, by Eqs. (3.38) and (3.30), 
© Јух(х, y) dy Рх) 
aoe Ny 
im тюш = ЛО) 
3.26. Consider the bivariate г.у. (X, Y) of Prob. 3.14. 


(a) Find the conditional pmf's py(y;| xi) and pyyy(x;| y). 
(b Find P(Y = 2|X = 2) апа P(X = 2]|Ү = 2). 


(а) From the results of Prob. 3.14, we have 


Pxy(Xi, у) = f 


тї(2х + Ур) 


otherwise 


Px(x;) = (4x; + 3) х= 1, 2 
Pry) = (2y; +6) yj— 1,2 


Thus, by Eqs. (3.33) and (3.34), 


2x; + yj 
Perl x) = 13 yj — 1,2; x; 2 1,2 
2x; + yj 
|уу=—————4 ‚=1,2;у;=1,2 
Рууу(Х(‹| Уу) 2y; +6 Xi У; 


(b) Using the results of part (a), we obtain 


P(Y = 2|X 22) = pyy(2]2) = 


P(X = 2] Y = 2) = pyy(2|2) = 


2(2) + 2 
4(2) + 3 
2(2) + 2 
2(2) + 6 








6 


1 
3 
5 


1 


х=1,2;у,= 1,2 
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3.27. Find the conditional pmf's рууу(у,| xj) and py,y(x;| yj) for the bivariate r.v. (X, Y) of Prob. 3.15. 


From the results of Prob. 3.15, we have 


1.2 
wrx Y = 1,2; уу = 1,2,3 
Xi, у) = . 
Poss, yj) f otherwise 


Thus, by Eqs. (3.33) and (3.34), 


35x y; 
Prol x) = MÀ. = iy у= 1. 2,3; х= 1, 2 
aX 
ху, 
paly) = atx? = 1,2; уу 1,2,3 
$Yj 


Note that Ру у{у;| х) = py(y) and рух: y) = Px(xj as must be the case since X and Y are independent, 
as shown in Prob. 3.15. 


3.28. Consider the bivariate r.v. (X, Y) of Prob. 3.17. 
(a) Find the conditional pdf’s fy,x(y| x) and у(х | y). 
(b Find P(O < Y <4|X = 1) 


(a) From the results of Prob. 3.17, we have 


1 
|Ofáx-» 0<х<2,0<у<2 
fx. »- otherwise 
h= O0«x«2 


)=al 
Лу)=ду +1) 0<у<2 
Thus, by Eqs. (3.38) and (3.39), 


PRX 1х+у 
furl “тту” 





0<х<2,0<у<2 


(х +1) 
fai = SE 1x» 


- 0<х<2,0 < 2 
1y-0) 2у+1 х ys 





(b) Using the results of part (a), we obtain 


1/2 1 1/2 l+y 5 
P(0 lix21)- =1)=- —— | 4у = — 
(0 « Y «3l ) | Лх (x = 1) Í ( 2 ) у= 35 


3.29. Find the conditional pdf's fy x(y | x) and /, у(х | у) for the bivariate г.у. (X, Y) of Prob. 3.18. 
From the results of Prob. 3.18, we have 
_ }4ху О0<х<1,0<у<1 
Sar у) = lo otherwise 
fx(x) = 2x О<х<1 
АЉ) = 22у 0<у<1 


Thus, by Eqs. (3.38) and (3.39), 


4 


faxo | x) = ХУ = 2y 0<у<1,0<х<1 


2x 
4xy 
Sal y = ZO = 2x О<х<1,0<у<1 
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Again note that fy;x(v |x) = f(y) and fyyy(«| у) = f(x), as must be the case since X and Y are independent, 
as shown in Prob. 3.18. 


3.30. Find the conditional pdf's fj (y | x) and fyjy(x | y) for the bivariate r.v. (X, Y) of Prob. 3.20. 
From the results of Prob. 3.20, we have 


2 О<у<х<1 
fais, 9) = t otherwise 
fx) = 2x О<х<1 


Лу) =2(1—у 0<у<1 
Thus, by Eqs. (3.38) and (3.39), 


1 
Љу) => у<х<1,0<х<1 


I 
Surly) = т ysx<1Q<x<1l 


3.31. The joint pdf of a bivariate r.v. (X, Y) is given by 


1 
—e@ ery x>O0,y>0 
Sav, y) = {у , 

0 otherwise 
(a) Show that fy у(х, y) satisfies Eq. (3.26). 
(b Find P(X > |У = y) 


(a) We have 
© ос © “| 
| | лола = | | = ee"? dx dy 
-w Ј- к о Jo Y 





(b) First we must find the marginal pdf on Y. By Eq. (3.31), 


oo 


a | 
A) -| Руб, y) dx = y e^? | e? dx 


0 

х= %0 
=e? 

x=0 


1 
fne J-e*”  x>0,y>0 


1 -| _ 
=-e? — ye х/у 
у 


Ву Eq. (3.39), the conditional pdf of Х is 





/Дхү(Х | у) = 
f» 0 otherwise 
Then P(X »1|Y = у) = | Firs, у) dx = | y ех» dx 
1 1 
= —eg y ili = e V» 
x=] 
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COVARIANCE AND CORRELATION COEFFICIENTS 
3.32. Let (X, Y) be a bivariate r.v. If X and Y are independent, show that X and Y are uncorrelated. 
If (X, Y) is a discrete bivariate r.v., then by Eqs. (3.43) and (3.22), 
E(XY) = У У xiy Paris y) = У У xy; bx PY) 


y) Xi vp X 


= р р vp) | = E(X)E(Y) 
xi » 


If (X, Y) is a continuous bivariate r.v., then by Eqs. (3.43) and (3.32), 


E(XY)= | | | | ху/уү(х, y) dx dy = | | | xyfy(x)fy(y) dx dy 


- | у(х) dx | уу) dy = E(X)E(Y) 


» - 


Thus, X and Y are uncorrelated by Eq. (3.52). 


3.33. Suppose the joint pmf of a bivariate r.v. (X, Y) is given by 


(0, 1), (1, 9), (2, 1) 


i 
PxyGGi, Y) = $ otherwise 


(a) Are X and Y independent? 
(b) Are X and Y uncorrelated? 
(a) By Eq. (3.20), the marginal pmf’s of X are 
px(0) = У Pxy(0, у) = рхү(0, I) = 4 
у} 


px(1) = У PxY(l, y) = рхк(1, 0) = i 


рх(2) = Y рху(2, Y) = Pyy(2, 1) = i 


» 


By Eq. (3.21), the marginal pmf's of Y are 
py(0) = У Рхү(х;. 0) = pxy(1, 0) = 3 
х 


рү(1) = У, pxy(xi, D = pxy(0, 1) + рху(2, 1) = 3 


and Pxy(0, 1) = 3 # ру(0)ру(1) = $ 
Thus X and Y are not independent. 
(P) By Ед. (3.45a), (3.456), and (3.43), we have 


E(X) = Y, xi pix) = (0X4) + (X + (300 = 1 
E(Y) = У ууру) = (0) + (14) = 4 
У} 


E(XY) = У У Xi Y;PxyUs. yj) 


» x 
= (0X D) + (0X0) + (214) = 3 
Now by Eq. (3.51), 
Cov(X, Y) = E(XY) — E(X)E(Y) =} – (03) = 0 
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Thus, X and Y are uncorrelated. 


Let (X, Y) be a bivariate r.v. with the joint pdf 





2 2 
X ЖУ q-oreynn 


—0<х<оо,—о<у<со 
4т 


fuos у) = 


Show that X and Ү are not independent but are uncorrelated. 
By Eq. (3.30), the marginal pdf of X is 


1 a 
Ло) = = | (x? + удете 5902 dy 





e Y? dy + F l 267212 iy) 


e 212 a 1 
= — |x? y 
2./2n -x 2a -» у 21 
Noting that the integrand of the first integral in the above expression is the pdf of №0; 1) and the second 
integral in the above expression 15 the variance of N(Q; 1), we have 


(x? + Dew??? -0 <х< о 


Лх) = 





І 
2 /2n 


Since fyy(x, y) is symmetric in x and y, we have 





(y? + New? —<у<оо 


1 
ЛО) = 
Uo Mn 
Now fyy(x, y) x /у(х) f(y), and hence X and Y are not independent. Next, by Eqs. (3.47a) and (3.47b), 
E(X) = | xfy(x) dx = 0 


7% 


E(Y)= | Жубу) dy = 0 


-= 


since for each integral the integrand is an odd function. By Eq. (3.43), 


E(XY) = f f xVfxy(x, y) dx dy = 0 


The integral vanishes because the contributions of the second and the fourth quadrants cancel those of the 
first and the third. Thus, E(X Y) = E(X)E(Y), and so X and Y are uncorrelated. 


Let (X, Y) be a bivariate r.v. Show that 
[E(X Y)]? x Е(Х?ЈЕ(Ү?) (3.97) 
This is known as the Cauchy-Schwarz inequalit y. 


Consider the expression E[(X — «Y)?] for any two r.v.s X and Y and a real variable a. This expres- 
sion, when viewed as a quadratic in о, is greater than or equal to zero; that is, 


E[(X – aY?] > 0 
for any value of х. Expanding this, we obtain 
E(X?) — 2aE(X Y) + «?Е(Ү?) > 0 
Choose a value of « for which the left-hand side of this inequality is minimum, 


_ E(XY) 
— EY?) 





a 
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which results in the inequality 


А 2 
LEX YW o ог  [E(XY)P < E(X^)EY?) 


E(X?) — AY) ? 


3.36. Verify Eq. (3.54). 
From the Cauchy-Schwarz inequality [Eq. (3.97)], we have 
{E[(X — рУ — ау) < ERX — py) JERY — n] 





or Oxy? € 0x7 ay" 
2 
Oxy 
Then px? = 55 < 1 
9x Oy 


Since руу is a real number, this implies 


lpxyl < 1 or —1 pyy <1 


3.37. Let (X, Y) be the bivariate r.v. of Prob. 3.12. 
(a) Find the mean and the variance of X. 
(b) Find the mean and the variance of Y. 
(c) Find the covariance of X and Y. 
(d) Find the correlation coefficient of X and Y. 
(a) From the results of Prob. 3.12, the mean and the variance of X are evaluated as follows: 
E(X) = У xi px(x) = (00.5) + (10.5) = 0.5 
E(X?) = 5, xi px(Xx;) = (0) (0.5) + (1)7(0.5) = 0.5 
ах? = EC) — [E( X) = 0.5 — (0.5? = 0.25 

(b) Similarly, the mean and the variance of Y are 


Е(Ү) = Y. y;py(y) = (000.55) + (00.45) = 0.45 


» 


ECY?) = У, y py(y) = (0)(0.55) + (1?(0.45) = 0.45 


» 
oy? = E(Y?) — [E(Y)]? = 0.45 — (0.45)? = 0.2475 
(c) By Eq. (3.43), 
E(XY) = Y Y Xi yjPxy Gi. yj) 


Ур xi 
= (0X0)(0.45) + (010.05) + (100.1) + (1)(1)(0.4) 
= 0.4 
By Eq. (3.51), the covariance of X and Y is 
Cov(X, Y) = E(XY) — E(X)E(Y) = 0.4 — (0.50.45) = 0.175 
(d) By Eq. (3.53), the correlation coefficient of X and Y is 
Cov(X, Y) _ 0.175 


xy = = = 
А agay 0250302475) 


0.704 





3.38. Suppose that a bivariate r.v. (X, Y) 1s uniformly distributed over a unit circle (Prob. 3.21). 
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(a) Are X and Y independent? 
(b) Are X and Y correlated? 


(a) Setting R = 1 in the results of Prob. 3.21, we obtain 


х® + у? < 1 
/ху(х, у) = 


coal 


х +у?> 1 

fi) == JT Ix| <1 
2 

fi - - VI -y |y«! 


Since fyy(x, y) # /у(х)/у(у), X and Y are not independent. 
(b) By Eqs. (3.47a) and (3.47b), the means of X and Y are 


1 

s»: | x/l — x? dx 20 
-1 
1 

gn» i| y/l-ydy-0 
-1 


since each integrand is an odd function. 
Next, by Eq. (3.43), 


BIXY) =~ Il xy dx dy 0 


x2ty2< 1 


The integral vanishes because the contributions of the second and the fourth quadrants cancel those of 
the first and the third. Hence, E(X Y) = E(X)E(Y) = 0 and X and Y are uncorrelated. 


CONDITIONAL MEANS AND CONDITIONAL VARIANCES 


3.39. Consider the bivariate r.v. (X, Y) of Prob. 3.14 (or Prob. 3.26). Compute the conditional mean 
and the conditional variance of Y given x; — 2. 


From Prob. 3.26, the conditional pmf py,,(y,| xj) is 


2x; + yj 
Риу х) = Fo y= l,2;x;= 1,2 
4+ у; 
Thus, Рих(у312) T ур= 1,2 


and by Eqs. (3.55) and (3.56), the conditional mean and the conditional variance of Y given x; = 2 are 





4+y, 
ШҮ; = E(Y |x; = 2) = У YjPyix G7] 2) = У vf 11 2 
vs yj 


sd de esa) 
GIG) 
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3.40. Let (X, Y) be the bivariate r.v. of Prob. 3.20 (or Prob. 3.30). Compute the conditional means 
E(Y | x) and E(X | y). 


From Prob. 3.30, 
1 
Ml =z у<х<1,0<х<1 
1 
Sxl) = т у<х<1,0<х<1 


By Eq. (3.58), the conditional mean of Y, given Х = x, is 














> * (1 yb x 
ayine | vari = D) => = > 0<х<1 
=% о NX 2х ly-o 2 
Similarly, the conditional mean of X, given Y = y, is 
o 1 1 x? x=1 l+y 
E(X|y) = MN dx - | 4; 1) ах = Xi —y |e, =z 0<у<1 


Note that E(Y | x) is a function of x only and E(X | y) is a function of y only. 


3.41. Let (X, Y) be the bivariate r.v. of Prob. 3.20 (or Prob. 3.30). Compute the conditional variances 
Var(Y | x) and Var(X | y). 


Using the results of Prob. 3.40 and Eq. (3.59), the conditional variance of Y, given X — x, is 
ac X 2 
Var(Y|x) = E{LY — E(Y beo |x} = | (» - z) Sixty x) dy 


bam?) 


-[6-30-z6-3 


Similarly, the conditional variance of X, given Y = y, is 


у=х х 


o 12 





у= 


А a I+y 2 
Var(X |y) = E([X — E(X |y? ly} = х5) forix ly) dx 


7 [ ( 7 (5) "xi = (s 7 +) 


N-DIMENSIONAL RANDOM VECTORS 


3.42. Let (X,, X2, X3, X4) be a four-dimensional random vector, where X, (k = 1, 2, 3, 4) are inde- 
pendent Poisson r.v.’s with parameter 2. 


(а) Find P(X, = 1, X;23, X422, X, = 1). 
(b) Find the probability that exactly one of the X,’s equals zero. 
(a) Ву Eq. (2.40), the pmf of X, is 


"7! (0-yy 


12 








x-y 


рй = P(X, =) e? i=0,1,... (3.98) 


i} 
Since the X,'s are independent, by Eq. (3.80), 
P(X, = 1, Х = 3, X, =2,X,= 1) = Px ())Px,(3)Px,(2)px,{1) 
e 22V e ?22NV e ?2? M e7?2 e ?27 m 
-( 1! X 3! Y 2! \ 1! )- jp © 35810") 
(b) First, we find the probability that X, = 0, k = 1, 2, 3, 4. From Eq. (3.98), 
P(X, =0)=e7? k=1,2,3,4 
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Next, we treat zero as “success.” If Y denotes the number of successes, then Y is a binomial r.v. with 
parameters (n, р) = (4, e` 2). Thus, the probability that exactly one of the X,'s equals zero is given by [Eq. 
(2.36)] 


Р(Ү = 1) = (Dec —e 7? x 0,35 


3.43. Let (X, Y, Z) be a trivariate r.v., where X, Y, and Z are independent uniform r.v.'s over (0, 1). 
Compute P(Z > XY). 


Since X, Y, Z are independent and uniformly distributed over (0, 1), we have 
Preval, y, 2) = Ју) fy zz) = 1 О<х<1,0<у<],0<г:<1 


1 1 1 
Then Р(2 > ХҮ) = [|| Fev, y, z) dx dy dz = | | | dz dy dx 
0 0 Xy 


2> ху 


Qf | x 3 
= ü so) dy dx = | ( je- 
MI , А 2 4 


3.44. Let (X, Y, Z) be a trivariate г.у. with joint pdf 


Ке бах byt са) х> 0, у> 0,2> 0 


Ју У 2) = f 





otherwise 
where a, b, с> О and k are constants. 


(a) Determine the value of K. 

(b) Find the marginal joint pdf of X and Y. 
(c) Find the marginal pdf of X. 

(d) Are X, Y, and Z independent? 

(a) By Eq. (3.76), 


f f f | Fxyz y, z) dx dy dz=k f | f grex thy tea) dx dy dz 
2x doa dex о Jo Jo 
x -ax a -by ^ _ К 
=k] ег ах | ет» ау | e77 dz = — = 1 
о o o abc 
Thus k = abc. 


(b) By Eq. (3.77), the marginal joint pdf of X and Y is 


/уү(х, У) = | fuy2lX, y. 2) dz = abc | eg axtbyren) dz 


0 
= арсе “х | e * dz = abet» х> 0, у> 0 
0 
(c) By Eq. (3.78), the marginal pdf of X is 


(e | | ий у. 2) dy de = abe | [етеш 
-a Jax o 


0 
= abee | e of e" dz=ae *  x»0 
0 o 
(d) Similarly, we obtain 


Sly) = | | tev y. z) dx dz 2» be ® у>б 
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3.45. 


fz) = | | Худ у, 2) dx dy = се © z>0 


Since f, yz(x, у, 2) = /у(х)/у(у)/ (2), X. Y, and 2 are independent. 


Show that 
Ју у, 2) = fzx. |х, Уху | x) fx(x) (3.99) 
By definition (3.79), 
_ Sxy2lX y, 2) 
Јах. vz 1x, y) fes y) » 
Hence Suv2l y, 2) = Јах. v2 1x у)/хү(Х, y) (3.100) 


Now, by Eq. (3.38), 
Sur y) = Јуху |х) (х) 
Substituting this expression into Eq. (3.100), we obtain 


fay2lx, y, 2) = fzix. (Z| x, PV Srp ХӘЛ) 


SPECIAL DISTRIBUTIONS 


3.46. 


3.47. 


Derive Eq. (3.87). 


Consider a sequence of n independent multinomial trials. Let A; (i = 1, 2, ..., К) be the outcome of a 
single trial. The r.v. X; is equal to the number of times А, occurs in the n trials. If x,, x3, ..., x, are 
nonnegative integers such that their sum equals n, then for such a sequence the probability that 4; occurs x; 
times, i= 1, 2,..., k—that is, Р(Х, = ху, X, = x,,..., X, = x,)—can be obtained by counting the number 
of sequences containing exactly x, 4,'s, x; 4,5, ..., x, A,’s and multiplying by p,"p;" +++ рх. The total 
number of such sequences is given by the number of ways we could lay out in a row n things, of which x, 
are of one kind, x; are of a second kind, ..., x, are of a kth kind. The number of ways we could choose x, 


positions for the A,’s is ( ) after having put the А, 55 in their position, the number of ways we could 
xy 


n Qo [nX . А 
choose positions for the A's is ( ) and so on. Thus, the total number of sequences with x, A,’s, x2 
X2 





A,’S,..., X, А, 815 given by 
(my ( -- x; = Хх = 1) 
Xi Xz X4 X, 
n! (n — x,)! (и = ху хә = x24)! 
хуи xy)! хи — x, -- x,)! x, 10! 
п! 
ERER ex! 
Thus, we obtain 
n! x x Xj 
Pxix; xe Хз, o X) = ху! ху! x 7 Pi P2 pep 


Suppose that a fair die is rolled seven times. Find the probability that 1 and 2 dots appear twice 
each; 3, 4, and 5 dots once each; and 6 dots not at all. 


Let (X p X5. ..., X 4) be a six-dimensional random vector, where X, denotes the number of times i dots 
appear in seven rolls of a fair die. Then (Х,, X ;, ..., Xe) isa multinomial r.v. with parameters (7, Pis рз, ..., 
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рь) where р, = & (i = 1, 2, ..., 6). Hence, by Eq. (3.87), 


7! IV AVION ANON (a 
PX, =2, X, = X, = LX, = 1, Xs = 1, Хв 0) = то :) YOG ;) 6 


т! AY 
=~ (5) = a x 0.0045 





6 


Show that the pmf of a multinomial r.v. given by Eq. (3.87) satisfies the condition (3.68); that is, 
УУ У рих, (х х2) = 1 (3.101) 


where the summation is over the set of all nonnegative integers ху, x2, ..., x, whose sum is n. 


The multinomial theorem (which is an extension of the binomial theorem) states that 
n 
(a, +а, ++ ау = У ( Jaran egy (3.102) 
хх) ry 
where X, +x, b +x, = папі 


( n ) n! 
ee ~ ex! 
х\Х› X, xi! x! ху! 


is called the multinomial coefficient, and the summation is over the set of all nonnegative integers x,, x5, ..., 
x, whose sum is n. 
Thus, setting a; = p, in Eq. (3.102), we obtain 


yy Y Pos Xj, es X) = (р + р; +++ pl = (1) = 1 


Let (X, Y) be a bivariate normal r.v. with its pdf given by Eq. (3.88). 


(a) Find the marginal pdf’s of X and Y. 
(b) Show that X and Y are independent when p = 0. 
(a) By Eq. (3.30), the marginal pdf of X is 


/х(х) = | /ху(х, y) dy 
From Eqs. (3.88) and (3.89), we have 


1 1 
fox. у = noroi — py Е 5 q(x, »| 


-l х-к х=юих\Му-ш\ (у-ү 
axs) =| Oy ) = 20( Ox X Oy )+( бү J 


Rewriting q(x, y), 
1 -u x-uxVP (х-к 
dix. y) = —— [(° ) ~ of 3l + ( i 
—p бү Ox ox 
1 бү 2 X — Hx 2 


1 х= ах \> 
PL 540 = І І 
Then х) = ——=—————*—— -= Е 5 q(x, »] dy 


noy -% 2л ey = pi? 


1 с 2 
— |v -uy — 8 — (x — ux) 
(1 — pay? yo tye 9x T 








where q(x, y) = 
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Comparing the integrand with Eq. (2.52), we see that the integrand is a normal pdf with mean 





By + р(су/схХх — их) and variance (1 — p?)o,?. Thus, the integral must be unity and we obtain 
1 —(x — uy)? 
fien = |687) (3.103) 
J 2n ox 20x 


In a similar manner, the marginal pdf of Y is 





Му) = epf E] (3.104) 


(b) When p = 0, Eq. (3.88) reduces to 


_ 1 I[fx—-uxV y-umyV 
fa D= дуа; af- 3f Ox ) «( бү )j 


"As |S") |o 80625) | 
sed (( ex jme, PL 2\ ду 
= fx) Љу) 


Hence, Х and Y are independent. 

















3.50. Show that p in Eq. (3.88) is the correlation coefficient of X and Y. 
By Eqs. (3.50) and (3.53), the correlation coefficient of X and Y is 


oN) 
- F = - any er y) dx dy (3.105) 


where fy,(x, y) is given by Eq. (3.88). By making a change in variables v = (x — py)/ox and w = (y — py)/oy, 
we can write Eq. (3.105) as 


"7 2n(1 — mina o - a <р? - 2m tw | d dw 


exp Le pur dv oe "^? dw 
-| 7 OUAIS п Ф| 7 3 Lg 


The term in the curly braces is identified as the mean of V = N(pw; 1 — p°), and so 


e? dw 





(ође */2 о | и? 


1 
Pxy = UR u Jn 


The last integral is the variance of W = N(0; 1), and so it is equal to 1 and we obtain руу = p. 


3.51. Let (X, Y) be a bivariate normal r.v. with its pdf given by Eq. (3.88). Determine E(Y | x). 


By Eq. (3.58), 
arw- |7 Уху) dy (3.106) 
where Syixy lx) = е» (3.107) 


Substituting Eqs. (3.88) and (3.103) into Eq. (3.107), and after some cancellation and rearranging, we obtain 


1 1 сү 2 
Pax) nod — ppya P? ^30 -p)5p ? "S — Hx) — Hy 
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which is equal to the pdf of a normal r.v. with mean py + p(oy/o Xx — их) and variance (1 — р2)су?. Thus, 
we get 


E(Y |x) = by t7 (~~ My) (3.108) 
x 


Note that when X and Y are independent, then p = 0 and E(Y | х) = py = E(Y). 


The joint pdf of a bivariate r.v. (X, Y) is given by 





1 
(х, у) = ех ЕЕ +у?+х—2 +D] -O <x, y < 
Љу "NP p-3 yty y y 


(a) Find the means of X and Y. 
(b) Find the variances of X and Y. 
(c) Find the correlation coefficient of X and Y. 


We note that the term in the bracket of the exponential is a quadratic function of x and y, and hence 
fxy(x. y) could be a pdf of a bivariate normal г.у. If so, then it is simpler to solve equations for the various 
parameters. Now, the given joint pdf of (X, Y) can be expressed as 


1 
Губ y) = WP оч] - 24(х, »| 
where dx, y) = 4x? — xy +y? +x- 2p 41) 
= 3[x? — x(y — 1) + (y — 17] 


Comparing the above expressions with Eqs. (3.88) and (3.89), we see that fxy(x, y) is the pdf of a bivariate 
normal r.v. with uy = 0, uy = 1, and the following equations: 


nox Gy / 1 — р? = 2 f 3n 


(1 — pox = (1 — p?)oy? = 3 








Solving for oy’, ay”, and p, we get 


Hence 

(a) The mean of X is zero, and the mean of Y is 1. 
(b) The variance of both X and Y is 2. 

(c) The correlation coefficient of X and Y is 4. 


Consider a bivariate r.v. (X, Y), where X and Y denote the horizontal and vertical miss dis- 
tances, respectively, from a target when a bullet is fired. Assume that X and Y are independent 
and that the probability of the bullet landing on any point of the xy plane depends only on the 
distance of tbe point from the target. Show that (X, Y) is a bivariate normal r.v. 


From the assumption, we have 


/ху(х, y) = fae fy) = glx? + у?) (3.109) 
for some function g. Differentiating Eq. (3.109) with respect to x, we have 
POOL AY) = 2ха'(х* + у?) (3.110) 


Dividing Eq. (3.110) by Eq. (3.109) and rearranging, we get 
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fi) gon + у?) 
2xfelX) д y) 


Note that the left-hand side of Eq. (3.111) depends only on x, whereas the right-hand side depends only on 
x? + y^; thus 





(3.111) 





x(x) 
ae =с (3.112) 
X 
where c is a constant. Rewriting Eq. (3.112) as 
, а 
n = cx or dx [In fi(x)] = ex (3.113) 
X ; 


and integrating both sides, we get 


In f(x) = 5 х +a ог fax) = ke? 


where а and k are constants. By the properties of a pdf, the constant с must be negative, and setting с = 
— l/c?, we have 
Sele) = ке 7100 
Thus, by Eq. (2.52), X = №0; с?) and 
] 
Лб) = 
* V 2ле 


In a similar way, we can obtain the pdf of Ү as 


= x2/(262) 





е 


1 


-у20202) 





ЛО) = 


е 
то 


Since X and Y are independent, the joint pdf of (X, Y) is 





1 g 02 +901002) 
2nc? 


Луб, y) = feo) = 


which indicates that (X, Y) is a bivariate normal r.v. 


Let (X,, X5, ..., X,) be an n-variate normal r.v. with its joint pdf given by Eq. (3.92). Show that 
if the covariance of X; and X; is zero for i з j, that is, 


CowX,, X) aè i=j (3.114) 
VI i, J = Oij = . А . 
Ц 1 1 0 i xj 
then X ,, X;,..., X, are independent. 
From Eq. (3.94) with Eq. (3.114), the covariance matrix K becomes 
c? 0 - 0 
0 oj? > 0 
К=|, о, . (3.115) 
0 0 o, 
It therefore follows that 
[det K|"? = сс о, = [о (3.116) 


апа 
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1 
— 0 > 0 
01 
1 
K!- 0 — 0 
92 
1 
0 0 —; 
e, 


Then we can write 


n — un? 
(х "Кх в) = Y (==) 
izi т 


Substituting Eqs. (3.116) and (3.118) into Eq. (3.92), we obtain 





1 n А 
ЛХ Xa) = т — Е 5 У ( 
2 i=. Ti 
Qr)" (fs) 
Now Eq. (3.119) can be rewritten as 


хуз xl Xt Ха) = [L9 
і=1 





where 
1 - 
fx(x) = eT aman 
| J 2 92, 
Thus we conclude that X,, X;,..., X, are independent. 


Supplementary Problems 


[CHAP 3 


(3.117) 


(3.118) 


(3.119) 


(3.120) 


Consider an experiment of tossing a fair coin three times. Let (X, Y) be a bivariate r.v., where X denotes the 


number of heads on the first two tosses and Y denotes the number of heads on the third toss. 


(a) Find the range of X. 
(b) Find the range of Y. 
(c) Find the range of (X, Y). 
(d Find (i) P(X < 2, Y < 1); (ii) P(X < 1, Y < 1); and (iii) P(X <0, Y < 0). 
Ans. (a) Ry = {0, 1, 2} 
(b) Ry = {0, 1} 


(с) Куу = {(0, 0), (0, 1), (1, 0), (1, 1), (2, 0), (2, 1)} 
(d) (0 РХ<2, У<1) = 1; (i) PIX <1,Y <1 


Let F,,(x, y) be a joint cdf of a bivariate r.v. (X, Y). Show that 
P(X >a, Y > с) =1— Fy(a)— Fy(c) + Fyy(a, с) 


where F(x) and F,(y) are marginal cdf's of X and Y, respectively. 


Hint: Set x, =a, y, = c, and x, = y, = оо in Eq. (3.95) and use Eqs. (3.13) and (3.14). 


) = ł; and (iii) P(X x0, Y x0) = 1 
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3.57. 


3.58. 


3.59. 


3.60. 


3.61. 


Let the joint pmf of (X, Y) be given by 


k(x, + у) x; = 1, 2,3; y;=1,2 
0 otherwise 


Byy Xi, yp = l 


where k is a constant. 


(a) Find the value of k. 
(b) Find the marginal pmf’s of X and Y. 


Ans. (а) ke 
(b ру(хд= з (2х+3) x;21,2,3 


pry) = (6 + 33) — yj- 1, 2 


The joint pdf of (X, Y) is given by 


ke 9*9 x>0,y>0 


Лху(Х, y) = t 


otherwise 
where k is a constant. 


(a) Find the value of k. 
(b) Find P(X > 1, Y <1), P(X < Y), and P(X x 2). 


Ans (a) k=2 


(b) P(X >t, ¥ <t)se7! — e~? 0.318; P(X < Y)=4; PIX < 2) = 1 — e`? æ 0.865 
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Let (X, Y) be a bivariate r.v., where X is a uniform r.v. over (0, 0.2) and Y is an exponential r.v, with 


parameter 5, and X and Y are independent. 
(a) Find the joint pdf of (X, Y). 
(b Find P(Y < X). 


2540? 0<х<02,у> 0 
Ans. (a) Ан J) = le otherwise 


(b) PUY < X) = e~} а 0.368 


Let the joint pdf of (X, Y) be given by 


7+0 > 0,у> 0 
fax Ў = pe NOM 
0 otherwise 


(a) Show that f,,(x, y) satisfies Eq. (3.26). 
(b) Find the marginal pdf’s of X and Y. 


Ans, (b }у(х)=е * х> 0 
Му) = у> 0 





(у + 1)? 


The joint pdf of (X, Y) is given by 


kx'(A—y) x<y<2x,0<x <2 
0 otherwise 


Луб, y) = | 


where k is a constant. 


(a) Find the value of k. 
(b) Find the marginal pdf’s of X and Y. 


Ans, (а) к= $ 
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(b) fx) = x74 — 3x) O<x<2 
Gpz(4 — yy 0<у<2 

ЖО) = SG — УХ8 – Фу) 2 <у<4 

0 otherwise 


The joint pdf of (X, Y) is given by 


_ fxye 02 x > 0, у> 0 
far, У) = t otherwise 
(а) Find the marginal pdf's of X and Y. 
(b) Are X and Y independent? 
Ans (а) Р(х) = xe 77  x»0 

Л) = ye"? y>0 

(b) Yes 
The joint pdf of (X, Y) is given by 
e ot» х> 0, у> 0 
Ља У) = M otherwise 


(a) Are X and Y independent? 
(b) Find the conditional pdf's of X and Y. 


Ans. (a) Yes 


(b fayxly)— e? х> 0 
Љу) = e у> 0 


The joint pdf of (X, Y) is given by 
e O<x<y 
Ља у = f otherwise 
(a) Find the conditional pdf’s of Y, given that X = x. 
(b) Find the conditional cdf’s of Y, given that X = x. 


Ans. (а) fgx(ylx) e" у>х 
0 
6) Fyx(ylx) = t rsx 


у> х 


— gx 


Consider the bivariate r.v. (X, Y) of Prob. 3.14. 
(a) Find the mean and the variance of X. 
(b) Find the mean and the variance of Y. 
(c) Find the covariance of X and Y. 
(d) Find the correlation coefficient of X and Y. 
Ans. (а) E(X) = 29, Var(X) = xa 

(b EY) = 5$, Var(Y) = 4f 

(с) Cov(X, Y)= – ті 

(d p= – 0.025 


Consider a bivariate г.у. (Х, Y) with joint pdf 
1 


Qe (88+ 979/202) 
2nc 


fox, y) = 





—ос<х,у<‹ 


Find P[(X, Y)| x? + y? x a?]. 
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3.67. 


3.68. 


3.69. 


Ans. ]—e "Qe 


Let (X, Y) be a bivariate normal r.v, where X and Y each have zero mean and variance c?, and the 
correlation coefficient of X and Y is p. Find the joint pdf of (X, Y). 
1 1 x? – 2pxy + y? 
Ans. Y= expl - = OS 
п5. Sxy(% у) 2no?(1 — р?)!? o| 2 ol- p?) 


The joint pdf of a bivariate r.v. (X, Y) is given by 
1 
Sav, У) = Jin Е $08 – xy + »»| 
л 


(а) Find the means and variances of X and Ү. 
(b) Find the correlation coefficient of X and Y. 


Ans. (а) py = ру = 0 e) = оү? = 1 


(b) р= } 


Let (X, Y, 2) be a trivariate r.v., where X, Y, апа 2 are independent and each has a uniform distribution 
over (0, 1). Compute P(X > Y > 2). 


Ans. i 


Chapter 4 


Functions of Random Variables, Expectation, 
Limit Theorems 


41 INTRODUCTION 


In this chapter we study a few basic concepts of functions of random variables and investigate the 
expected value of a certain function of a random variable. The techniques of moment generating 
functions and characteristic functions, which are very useful in some applications, are presented. 
Finally, the laws of large numbers and the central limit theorem, which is one of the most remarkable 
results in probability theory, are discussed. 


42 FUNCTIONS OF ONE RANDOM VARIABLE 
A. Random Variable g(X): 
Given a r.v. X and a function g(x), the expression 
Y = ((X) (4.1) 


defines a new r.v. Y. With y a given number, we denote Dy the subset of Ry (range of X) such that 
g(x) x y. Then 


(Y € y) = [q(X) € y] = (X € Dy) (4.2) 
where (X є Dy) is the event consisting of all outcomes ¢ such that the point X(¢) є Dy. Hence 
Fy(y) = P(Y < у) = P[g(X) < y] = P(X € Dy) (4.3) 


If X is a continuous r.v. with pdf (х), then 


Fly) = | Ix(x) dx (4.4) 
Dy 


B. Determination of f,(y) from f(x): 


Let X be a continuous r.v. with pdf /у(х). If the transformation y = g(x) is one-to-one and has the 
inverse transformation 


х= 9 Чу) = Му) (4.5) 
then the pdf of Ү is given by (Prob. 4.2) 


dh(y) 
dy 
Note that if g(x) is a continuous monotonic increasing or decreasing function, then the transfor- 


mation y = g(x) is one-to-one. If the transformation y = g(x) is not one-to-one, fy) is obtained as 
follows: Denoting the real roots of y = g(x) by x, , that is, 


SAY) = fae) = //‹[Щу)] (4.6) 











dx 
dy 





у= glx) == ху) = (4.7) 
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then fi) = Y T | 


where g'(x) is the derivative of g(x). 


43. FUNCTIONS OF TWO RANDOM VARIABLES 
A. One Function of Two Random Variables: 
Given two r.v.'s X and Y and a function g(x, y), the expression 


Z = g(X, Y) 
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(4.8) 


(4.9) 


defines a new r.v. Z. With z a given number, we denote D; the subset of Ry, [range of (X, Y)] such 


that g(x, y) € z. Then 
(2<2) = [9(Х, Y) < z] = (X. Y) € Dz} 


(4.10) 


where {(X, Y) e Dz} is the event consisting of all outcomes ¢ such that the point {X(Q), Y(Q)} € Dz. 


Hence 
F(z) = Р(2 < 2) = Р[0(Х, Y) xz] = P{(X, Y) e Dz} 


If X and Y are continuous r.v.'s with joint pdf fy, (x, у), then 


ЕД2) = {| Jxx(x, у) dx dy 
Dz 


B. Two Functions of Two Random Variables: 
Given two r.v.’s X and Y and two functions g(x, y) and h(x, y), the expression 


Z =9(X, Y) W =h(X, Y) 


(4.11) 


(4.12) 


(4.13) 


defines two new r.v.'s Z and W. With z and w two given numbers, we denote Dzy the subset of Ry, 


[range of (X, Y)] such that g(x, y) x z and A(x, y) < w. Then 
(Z <z, W <w)=[9(X, Y) < z, (X, Y) < w] = {(X, Y) € Dzw} 


(4.14) 


where ((X, Y) € Dzw} is the event consisting of all outcomes ¢ such that the point {X(Q), Y(Q} є Dzy. 


Hence 
Fzw(z, w) = P(Z < z, W < w) = P[g(X, Y) < 2, AX, Y) < w] 
= P((X, Y) e Dzw} 


In the continuous case, we have 


Fzw(z, w) = | Ју y) dx dy 


Dzw 


Determination of fz yz, w) from fyy(x, y): 
Let X and Y be two continuous г.у with joint pdf у; у(х, у). If the transformation 
z = g(x, y) w = h(x, у) 
is one-to-one and has the inverse transformation 


x — q(z, w) y = r(z, w) 


(4.15) 


(4.16) 


(4.17) 


(4.18) 
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then the joint pdf of Z and W is given by 


faw(z, w) = Јуу(х, y) Jos y)! (4.19) 
where x = q(z, w), y = (т, w), and 
dg 0] |2: ё 
д 
ge y= [OX D| -~ ov (4.20) 


ahah) ~ Jaw д 
ôx ду дх ду 
which is the jacobian of the transformation (4.17). If we define 
дда дд дх дх 
Oz ди д> ду 


Д2, м) = ar ar = ду ay (4.21) 
д2 aw д2 ду 
then | J(z, w)| = | Их, y^! (4.22) 
and Eq. (4.19) can be expressed as 
Губ, w) = fxvba(z, w), riz, w)]| Jiz, и) (4.23) 


44 FUNCTIONS OF п RANDOM VARIABLES 


A. One Function of л Random Variables: 


Given n r.v/s X,,..., X, and a function g(x, ..., x,), the expression 
Y —2g(X,,..., X) (4.24) 
defines a new r.v. Y. Then 

(Y x y) = (Xi ..., X) < y] = [(X s. ..., X) E Dy] (4.25) 
and Faly) = P[g(X,, ..., X) E y] = POX, ..., X,) e Dy] (4.26) 
where Dy is the subset of the range of (X,, ..., X,) such that g(x,, ..., x,) x y. If X,, ..., X, are 

continuous r.v.'s with joint pdf fy, ... х (x, ..., Xp} then 
Ру) = |, e [л caes X) dx, or dx, (4.27) 


B. n Functions of п Random Variables: 


When the joint pdf of nr.v.’s X,,..., X, is given and we want to determine the joint pdf of n r.v.'s 
Y, ..., Y,, where 


Y, = gi(X y, ..., Х„) 


: (4.28) 
Y, = (Х|, ttt X,) 
the approach is the same as for two r.v.’s. We shall assume that the transformation 
Yi = G(X, X) 
: (4.29) 


Yn = IAX 15 зз Xn) 
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is one-to-one and has the inverse transformation 
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x, = ha, 5 Yn) 
' (4.30) 
Xn = һу, бб, Уһ) 
Then the joint pdf of Ү,,..., Ү, is given by 
Ду, ™ vv ttt. yn) = fy Ut х.х, сөө хь)|Д(х |, arty x)! (4.31) 
Ox, Ox, 
where | Cae oo eae 6 3 (4.32) 
0g, ... 909, 
Ox Óx, 
which is the jacobian of the transformation (4.29). 
45 EXPECTATION 
A. Expectation of a Function of One Random Variable: 
The expectation of Y = g(X) is given by 
У g(x)px(x) (discrete case) 
E(Y) = E[g(X)) = (4.33) 
х) fx) dx (continuous case) 
B. Expectation of a Function of More than One Random Variable: 
Let X,,..., X, ben r.v/s, and let Y = g(X,,..., X,). Then 
YY glx gs ..., Xx x e xXx) (discrete case) 
Xi Xn 
E(Y) = E[g(X)) = 
| e { g(xy s Xa) Sxi x Xp o X) dx, + dx, (continuous case) 
(4.34) 
C. Linearity Property of Expectation: 
Note that the expectation operation is linear (Prob. 4.39), and we have 
Д Y ах) = Ya EX) (4.35) 
i=] i=1 
where аг are constants. If r.v.’s X and Y are independent, then we have (Prob. 4.41) 
E[g(X)h(Y)] = E[g(X)]ELACY)] (4.36) 
The relation (4.36) can be generalized to a mutually independent set of n r.v/s X,,..., Х,: 
e| fi axi = П Elg{X)] (4.37) 
і= 1 i=l 
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D. Conditional Expectation as a Random Variable: 


In Sec. 3.8 we defined the conditional expectation of Y given X = x, E(Y | x) [Eq. (3.58)], which is, 
in general, a function of x, say H(x). Now H(X) is a function of the r.v. X ; that is, 


A(X) = E(Y| X) (4.38) 
Thus, E(Y | X) is a function of the r.v. X. Note that E(Y | X) has the following property (Prob. 4.38): 
E[E(Y | X)] = E(Y) (4.39) 


46 MOMENT GENERATING FUNCTIONS 
A. Definition: 
The moment generating function of a r.v. X is defined by 
У e"'px(x) (discrete case) 
M(t) = Ele”) = | (4.40) 


© 
е“/у(х) dx (continuous case) 


— 00 


where t is a real variable. Note that M,(t) may not exist for all r.v.'s X. In general, M x(t) will exist 
only for those values of t for which the sum or integral of Eq. (4.40) converges absolutely. Suppose 
that M x(t) exists. If we express e'* formally and take expectation, then 


M,(t) = E(e*) = ef +tX + -, (Xy + + E (LX + | 


1? tt 
= 1 + E(X) + ЕХ) + ЕХ) +o (4.41) 
and the kth moment of X is given by 
m, = E(X") = M x™(0) k=1,2,... (4.42) 
d* 
where M ,9(0) = jr M y(t) (4.43) 
1=0 


В. Joint Moment Generating Function: 
The joint moment generating function M yy(t,, t2) of two r.v/s X and Y is defined by 
M уу, t2) = E[e!* * 27] (4.44) 


where t, and t; are real variables. Proceeding as we did in Eq. (4.41), we can establish that 





© о t kt п 
Му t) = Ее ery = у у с Ege (4.45) 
and the (k, n) joint moment of X and Ү is given by 
Mın = E(X*Y") = Myy™"(0, 0) (4.46) 
gtt” 
where My, "X0, 0) = —— Мум, t2) (4.47) 
д 110" 11212 = 0 


In a similar fashion, we can define the joint moment generating function of n r.v.'s X,, ..., X, by 


My, xls ss by) = Ep) tte] (448) 
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from which the various moments can be computed. If X,,..., X, are independent, then 
My, xti st) = Ее"? * X9] = (ен... eret) 
= Efe”) --- E(e™*") = My (ti) ++: Mit) (4.49) 


C. Lemmas for Moment Generating Functions: 
Two important lemmas concerning moment generating functions are stated in the following: 


Lemma 4.1: If two rvs have the same moment generating functions, then they must have the same 
distribution. 


Lemma 4.2: Given саѓѕ F(x), F,(x) F(x), ... with corresponding moment generating functions M(t), M (t), 
M,(t),..., then F,(x) > F(x) if M,(t) > M(t). 


47 CHARACTERISTIC FUNCTIONS 
A. Definition: 
The characteristic function of a r.v. X 1s defined by 


У el@*ipy(x)) (discrete case) 
Vio) = Ele) = | (4.50) 


е?®* f(x) dx (continuous case) 
where о is a real variable and j = y —1. Note that Y (œw) is obtained by replacing t in M x(t) by jæ if 
M,(t) exists. Thus, the characteristic function has all the properties of the moment generating func- 
tion. Now 





| do) = | Y е/®ру(х)) ES Y | ё®®р(х)| = Y px(x) = 1 < oo 
for the discrete case and 


IW y(o)| = 


70 





< [` e= f(x) dx | = F f(x) dx = 1< oo 





F ef f(x) dx 


for the continuous case. Thus, the characteristic function ‘Y,(@) is always defined even if the moment 
function M x(t) is not (Prob. 4.58). Note that V (ш) of Eq. (4.50) for the continuous case is the Fourier 
transform (with the sign of j reversed) of f(x). Because of this fact, if V (œw) is known, fy(x) can be 
found from the inverse Fourier transform; that is, 


oo 


1 | 
fx) = on { Ф (ође i do (4.51) 
B. Joint Characteristic Functions: 


The joint characteristic function V y(@),, œ) of two r.v.'s X and Y is defined by 


Ухо}, @2) = E[e/iX* o2] 
Y, Y emite p xi, yy) (discrete case) 
i k 


(4.52) 


| | ејбоіх +29) f(x, у) ах dy (continuous case) 


where о, and œ, are real variables. 
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The expression of Eq. (4.52) for the continuous case is recognized as the two-dimensional Fourier 
transform (with the sign of j reversed) of /уу(х, y). Thus, from the inverse Fourier transform, we have 


Sav y) "os: r [tutos «e Ht do. dws (4.53) 
From Eqs. (4.50) and (4.52), we see that 
P (c) = Ф хуб, 0) P y(o) = P xy(0, о) (4.54) 
which аге called marginal characteristic functions. 
Similarly, we can define the joint characteristic function of n r.v.’s X,,..., X, by 
Чу xg ys coy 0, = E[e m tto] (4.55) 


As in the case of the moment generating function, if X,,..., X, are independent, then 


Vg х (Q9 oss On) = Frw) + Vy(o,) (4.56) 


C. Lemmas for Characteristic Functions: 
Ás with the moment generating function, we have the following two lemmas: 
Lemma 4.3: A distribution function is uniquely determined by its characteristic function. 


Lemma 4.4: Given cdf's F(x), Р(х), F(x), ... with corresponding characteristic functions (o), V (c), Ч о), 
., then F(x) > F(x) at points of continuity of F(x) if and only if V,(c) > Ҹ(о) for every о. 


48 THE LAWS OF LARGE NUMBERS AND THE CENTRAL LIMIT THEOREM 
A. The Weak Law of Large Numbers: 


Let X,,..., X, be a sequence of independent, identically distributed r.v.’s each with a finite mean 
ЕХ) = и. Let 


— 


] n 

OLX =- (X+ + Xy (4.57) 
Then, for any ғ > 0, 

lim P(| X, — u| > в) = 0 (4.58) 


n+ x: 


Equation (4.58) is known as the weak law of large numbers, and X, is known as the sample mean. 


B. The Strong Law of Large Numbers: 


Let X,,..., X, be a sequence of independent, identically distributed r.v.'s each with a finite mean 
E(X,) = џи. Then, for any e > 0, 
P( im iX, - uo е) =0 (4.59) 


where X, is the sample mean defined by Eq. (4.57). Equation (4.59) is known as the strong law of large 
numbers. 
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Notice the important difference between Eqs. (4.58) and (4.59). Equation (4.58) tells us how a 
sequence of probabilities converges, and Eq. (4.59) tells us how the sequence of r.v.’s behaves in the 
limit. The strong law of large numbers tells us that the sequence (X,) is converging to the contant д. 


C. The Central Limit Theorem: 


The central limit theorem is one of the most remarkable results in probability theory. There are 
many versions of this theorem. In its simplest form, the central limit theorem is stated as follows: 

Let X, ..., X, be a sequence of independent, identically distributed r.v.'s each with mean p and 
variance о?. Let 


7 Xt + Х, па Х-и 

" o fn cj fn 

where X, is defined by Eq. (4.57). Then the distribution of Z, tends to the standard normal as n > oo; 
that 1s, 


(4.60) 


lim Z, = N(0; 1) (4.61) 
or 
lim F,,(z) = lim P(Z, < 2) = ®(2) (4.62) 


where @(z) is the cdf of a standard normal r.v. [Eq. (2.54)]. Thus, the central limit theorem tells us 
that for large n, the distribution of the sum S, = X, ++- + X, is approximately normal regardless of 
the form of the distribution of the individual Xs. Notice how much stronger this theorem is than the 
laws of large numbers. In practice, whenever an observed r.v. is known to be a sum of a large number 
of r.v.’s, then the central limit theorem gives us some justification for assuming that this sum is 
normally distributed. 


Solved Problems 


FUNCTIONS OF ONE RANDOM VARIABLE 

41. If X is N(u; a°), then show that Z = (X — p)/¢ is a standard normal r.v.; that is, №0; 1). 
The cdf of Z is 

= и 





Fle) = uz <2) e «sje PX зови 





20 -u 1 
= e Common dy 
E y 2n o 


By the change of variable y = (x — y)/o (that is, x = ту + д), we obtain 


7 1 
кдй = Pz «a | — eo йу 
TE 2л 
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4.2. 


43. 
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аЕ д2) 1 -an 


dz Jn 


and 742) = 








which indicates that Z = N(0; 1). 


Verify Eq. (4.6). 


Assume that y = g(x) is a continuous monotonically increasing function [Fig. 4-1(a)]. Since y = g(x) is 
monotonically increasing, it has an inverse that we denote by x = g^ !(y) = h(y). Then 


Fy(y) = P(Y < y = P[X < hy)] = РГА) (4.63) 
d d 
and ho) = dy Fy(y) = dy UFxERQOM] 
Applying the chain rule of differentiation to this expression yields 


d 
Лу) = AAO] dy h(y) 


which can be written as 


d 
EOE xc ALY) (4.64) 
y 
If y = g(x) is monotonically decreasing [Fig. 4.1(b)], then 
Еу(у) = P(Y x y) = P[X > h(y)] = 1 — Е.А) (4.65) 
d d 
Thus, М) =; РАЙ) = A x= hy) (4.66) 


In Eq. (4.66), since y = g(x) is monotonically decreasing, dy/dx (and dx/dy) is negative. Combining Eqs. 
(4.64) and (4.66), we obtain 
dh(y) 


Sly) = f(x) dy 











d 
2 | = fefhty)] 


which is valid for any continuous monotonic (increasing or decreasing) function у = g(x). 


Let X be a r.v. with cdf Fy(x) and pdf fy(x). Let Y = aX + b, where a and b are real constants 
and a = 0. 


(a) Find the cdf of Y in terms of F y(x). 


Y E 





(a) (b) 
Fig. 4-1 
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(a) (b) 
Fig. 4-2 


(b) Find the pdf of Y in terms of f(x). 
(a) а> 0, then [Fig. 4-2(a)] 


Fy) = PY < y) = PaX + b < y) = r(x < — = FA? = *) (4.67) 





If a < 0, then (Fig. 4-2(b)] 
Еу) = P(Y < y) = Pax +b x y) = F(aX < y — b) 


- Hx E ut (since a « 0, note the change 
a in the inequality sign) 











-1- #,(? = *) + of x = =” (4-68) 


Note that if X is continuous, then Р[Х = (y — b)/a] = 0, and 
—b 
FY) -1— r=) a<0 (4.69) 


(b) From Fig. 4-2, we see that y = g(x) = ax + b is a continuous monotonically increasing (a > 0) or 
decreasing (а < 0) function. Its inverse is x = g~'(y) = Ау) = (y — b)/a, and dx/dy = 1/а. Thus, by Eq. 
(4.6), 


—b 
ho = L ({#—”) (4.70) 


Note that Eq. (4.70) can also be obtained by differentiating Eqs. (4.67) and (4.69) with respect to у. 


44. Let Y = aX + b. Determine the pdf of Y, if X is a uniform г.у. over (0, 1). 
The pdf of X is [Eq. (2.44)] 

О<х<1 

otherwise 


1e |, 


Then by Eq. (4.70), we get 


y-b yeR, 


1 
] —— 
ЛУ) = Tal s (155) = {| а (4.71) 


otherwise 
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4.6. 
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T—« у=ах+Ь 





a»0 а<0 


Fig. 4-3 


The range Ry is found as follows: From Fig. 4-3, we see that 
For a> 0: Ry={y:b<y<a+b} 
For a « 0; Ry={y:ath<y<b} 


Let Y = aX + b. Show that if X = М(и; o?), then Y = N(au + b; а20?), and find the values of a 
and b so that Y = N(0; 1). 


Since X = N(u; o°), by Eq. (2.52), 





Hence, by Eq. (4.70), 





y-b ? 
Sly) = opf- s ) — «| | 
J miae 
opf- zz xag U- iu np] (4.72) 
ү JAja|e 
which is the pdf of М(ад + b; а202). Hence, Y = "i (ар + b; a?o?). Next, let au + b = 0 and a?o? = 1, from 
which we get a = 1/o and b = —и/о. Thus, Y = — yu) e is N(0; 1) (see Prob. 4.1). 


Let X be a rv. with pdf (х). Let Y = X?. Find the pdf of Y. 


The event A — (Y x y) in R, is equivalent to the event B -(- y «Xx Jy) in Ку (Fig. 4-4). If 
y € 0, then 


and f(y) = 0. If y > 0, then 


Fy(y) = PY < у) = Р(—,/у € X < fy) = FX /9) — Fl — J) (4.73) 





d d d 1 
and fin = т; Fn = т; Fal 7. РКМ) = 5 үг Ul) + /х\— JV] 
1 
EUV А-У  y»o 
Thus, Лу) = 2/y xv vy y? (4.74) 


0 у<0 
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Fig. 4-4 


Alternative Solution: 
If y < 0, then the equation y = x? has no real solutions; hence f,(y) = 0. If y > 0, then y = x? has two 
solutions, x, — Jy and x, = - Jy. Now, y = g(x) = x? and g'(x) = 2x. Hence, by Eq. (4.8), 


1 А 
— t 9» f - 90 0 
fi) = 42/9 AIF HIV у> 


0 у<0 
47. Let Y = X?. Find the pdf of Y if X = М(0; 1). 
Since X = N(0; !) 


fo = = е2 


Since f,(x) is an even function, by Eq. (4.74), we obtain 


1 1 
— fe» = еэ? 0 
fo = Ay у> 
0 у<0 





(4.75) 


48. Let Y = X?. Find and sketch the pdf of Y if X is a uniform r.v. over (— 1, 2). 
The pdf of X is (Eq. (2.44)] (Fig. 4-5(a)] 


~l<x<2 


1 
Ax) = К 


otherwise 


In this case, the range of Y is (0, 4), and we must be careful in applying Eq. (4.74). When 0 « y < 1, both 
Jy and — Sy аге in Ry = (— 1, 2), and by Eq. (4.74), 


me (11) 1. 
Y 2 Jy 3*3 з/у 


When 1 < y < 4, /y is in Ry = (—1, 2) but —\/y < — 1, and by Eq. (4.74), 


1 1 1 
= |01 
An 2 /y E t ) 6 / y 


1 
— 0О<у<1 
з/у 
Thus, ЛО» = \_! l<y<4 
6 y 
0 otherwise 


which is sketched in Fig. 4-S(b). 
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fon 


AQ 





(a) (b) 
Fig. 4-5 


49. Let Y = e*. Find the pdf of Y if X is a uniform r.v. over (0, 1). 
The pdf of X is 


мә={, 0<x<1 


0 otherwise 
The cdf of Y is 
Fy) = P(Y < y) = P(e* < у) = P(X x Iny) 


In y In y 
-[ p dx =Iny l<y<e 
: 0 


4 4 1 
Thus, Лу) = — Foy) = — у = – l<y<e 4.76 
00) =F X 4, 7j y (4.76) 
Alternative Solution: 


The function y = 9(х) = e* is a continuous monotonically increasing function. Its inverse is 
x = 97 Ҷу) = Му) = In y. Thus, by Eq. (4.6), we obtain 








1 
. а 1 - O«Iny«li 
Лу) = Ља y) d Iny | = = fxn у) = 3» 
d у 0 otherwise 
E l<y<e 
or Sly) = яу 
0 otherwise 


4.10. Let Y = ех. Find the pdf of Y if X = №и; 02). 
The pdf of X is [Eq. (2.52)] 





1 1 
Be) = „Оло e 2o? œ- | 72559 


Thus, using the technique shown in the alternative solution of Prob. 4.9, we obtain 





1 1 1 
Sly) = y fx(n y) = n? exp| - 553 (In y – | 0 € y « o (4.77) 


Note that X — In Y is the normal r.v.; hence, the r.v. Y is called the log-normal r.v. 
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4.1. 


4.12. 


4.13. 


Let Y = tan X. Find the pdf of Y if X is a uniform r.v. over (— 2/2, n/2). 


The cdf of X is [Eq. (2.45)] 


0 x € —n[2 
F(x) = T —n/2«x«mj2 

1 x > п/2 

Now Fy(y) = P(Y < y) = P(tan X x y) = P(X «tan! y) 
= Pytany) =t (ant + 2) e E anty —o«y«o 
Then the pdf of Y is given by 
d 
Ло) = FF) = S —-0 <y < о 


Note that the r.v. Y is a Cauchy r.v. with parameter 1. 


Let X be a continuous r.v. with the cdf F(x). Let Y = F(X). Show that Y is a uniform r.v. over 
(0, 1). 


Notice from the properties of a cdf that y = F(x) is a monotonically nondecreasing function. Since 
0 x F,(x) < 1 for all real x, y takes on values only on the interval (0, 1). Using Eq. (4.64) (Prob. 4.2), we 
have 


1 
=f) Be 200 - 


аР fy! О<у<1 


1 
Ду) = fx) dylx 


Hence, Y is a uniform r.v. over (0, 1). 


Let Y be a uniform r.v. over (0, 1). Let F(x) be a function which has the properties of the cdf of a 
continuous r.v. with F(a) = 0, F(b) = 1, and F(x) strictly increasing for a < x < b, where a and b 
could be — со and оо, respectively. Let X = F^! (Y). Show that the cdf of X is F(x). 


Fy(x) = Р(Х < x)= P[F (Y) < x] 
Since F(x) is strictly increasing, F~'(Y) x x is equivalent to Y < F(x), and hence 
Fy(x) = P(X < x)= PLY x F(x)) 
Now Y is a uniform r.v. over (0, 1), and by Eq. (2.45), 
Еу) = Р(У<у)= у Oxy«l 
and accordingly, 
Fy(x) = P(X < x) = PLY < F(x)] = F(x) 0 < F(x) «1 


Note that this problem is the converse of Prob. 4.12. 
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4.14. Let X be a continuous r.v. with the pdf 


7х х> 0 
х<0 


А) = M 


Find the transformation Y = g(X) such that the pdf of Y is 


1 
TE 0 1 
fo m 077 


0 otherwise 
The cdf of X is 


[esa a х> 0 
o = 


Fy(x) -f Л de = 0 x<0 
-% 0 


Then from the result of Prob. 4.12, the r.v. Z = 1 — e^* is uniformly distributed over (0, 1). Similarly, the 
cdf of Y is 


Y | 
—— dg 0 
Еу) = [55 -{? <у<1 


0 0 otherwise 


and the r.v. W = ./Y is uniformly distributed over (0, 1). Thus, by setting Z = W, the required transfor- 
mation is Y = (1 — e *y". 


FUNCTIONS OF TWO RANDOM VARIABLES 


415. Consider Z = X + Y. Show that if X and Y are independent Poisson r.v.’s with parameters A, 
and åz, respectively, then Z is also a Poisson r.v. with parameter 4, + À;. 


We can write the event 


X 2Y2nesQ(X-iY-n-iü 


i-0 


where events (X =i, Y = n— i, i = 0, 1, ..., n, are disjoint. Since X and Y are independent, by Eqs. (1.46) 
and (2.40), we have 


PZ = п) = P(X + Y = п) = y PX =i, Y=n—)= Y PX -)P(Y =n- i) 











i-0 i-0 
n i n-i n ijn-i 
= Уе Ay -a 2 2g tàn Ai Ay 
i-o i! (n— i io (n — i)! 
-(41+А2) п t 
€ n: À igi 
= 1 т yp ^1^2 
п! j=0 H(n— i)! 
eg Ot» 
= л 
=—— hi ÀJ 


which indicates that Z = X + Y is a Poisson r.v. with 4, + 45. 


416. Consider two r.v.’s X and Y with joint pdf fxy(x, y. Let Z = X + Y. 


(a) Determine the pdf of Z. 
(b) Determine the pdf of Z if X and Y are independent. 
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БЯ 


P 


Fig. 4-6 


"4 


(a) The range Rz of Z corresponding to the event (Z < 2) = (X + Y < 2) is the set of points (x, y) which lie 
on and to the left of the line z = x + y (Fig. 4-6). Thus, we have 


ЕД2) = Р(Х + Ү < 2) = F INE y) a| dx (4.78) 
d æ Га [e 
Then Sez) = = Едш) = E Е E Sx y) | ах 
= | лба) dx (4.79) 


- о 


(b) ЕХ and Y are independent, then Eq. (4.79) reduces to 
72) = | Sl) fz — x) dx (4.80a) 


The integral on the right-hand side of Eq. (4.80a) is known as a convolution of f,(z) and fy(z). Since the 
convolution is commutative, Eq. (4.80a) can also be written as 


мә= |" foie n (4.800) 


4.17. Using Eqs. (4.19) and (3.30), redo Prob. 4.16(a); that is, find the pdf of Z = X + Y. 


Let Z=X+ Y and W =X. The transformation z = x + у, w = x has the inverse transformation 
x=w,y =z – м, апа 


az ûz 

дх ду 1 | 
J0 7 оь aw -|i jn 

дх ду 


By Eq. (4.19), we obtain 
Југ, №) = fyw, z — w) 
Hence, by Eq. (3.30), we get 


faa = [` ш, w) dw = [` for, z — w) dw = | fd, z — x) dx 


4.48. Suppose that X and Y are independent standard normal r.v.’s. Find the pdf of Z = X + Y. 
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The pdfs of X and Y are 





1 ; 1 
ful) =e? Му-—=е?" 


\/2х \/2х 
Then, by Eq. (4.80а), we have 


e 1 





Já) = [Ace -x)dx- FL Js e n A g G7 5202 dy 


1 wm 


= -— eg Ccx* 2x22 dx 
2n J-a 


Now, 22 — 2zx + 2x? = (2x ~ 1/2) + 22/2, and we have 


1 —234 І ° - (V2 x- z//2)2/2 
e е“ dx 


1 1 


1 
= ~22jq ____ 
Jm Alem 


with the change of variables u = „2х — 2/./2. Since the integrand is the pdf of №0; 1), the integral is 
equal to unity, and we get 





faz) = 


е7"? du 








-aMA _ -221204/2)2 


1 1 
HOT се" "суут; 


which is the pdf of N(0; 2). Thus, Z is a normal r.v. with zero mean and variance 2. 


4.19. Let X and Y be independent uniform r.v.’s over (0, 1). Find and sketch the pdf of Z = X + Y. 


Since X and Y are independent, we have 


1 O0<x<1O0<y<l 
0 otherwise 


fus у) = Л) ЛО) = | 
The range of Z is (0, 2), and 


F,(z)= P(X + Y <z)= {| fxy(x. y) dx dy = [|| 4х ду 


x+ysz x+ysz 


If 0 « z « 1 [Fig. 4-7(a)], 


2 
Fz) = [|| dx dy = shaded area =5 





х+у<2 
а 
апа Л) = = Fz) = 2 
dz 
If 1 <z < 2 [Fig. 4-7(b)], 
a2 
F(z) = {| dx dy = shaded area = | — @ 7 2 
х+у<2 
а 
апа fz(z) = — ЕД2) = 2 – 2 
dz 
2 0<2<1 
Непсе, Г) = 2—2 1 <2<2 


0 otherwise 
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(b) 
ГА) 


which is sketched in Fig. 4-7(c). Note that the same result can be obtained by the convolution of f(z) and 


Лб). 


4.20. Let X and Y be independent gamma r.v.'s with respective parameters (a, А) and (f, 4). Show that 
Z = X + Y is also a gamma г.у. with parameters (a + f, A). 


From Eq. (2.76) (Prob. 2.24), 


Ae (ху! 
fien = — T ^ (а) х > 0 
0 х<0 
лег (дурт! 
———=——— 0 
A= T у> 
0 у<0 


The range of Z is (0, со), and using Eq. (4.80a), we have 











42) = Fan | Ae "xy" He Mae — y^! dx 

0 

да+ z 

“OA | хе dx 

0 

By the change of variable w = x/z, we have 
atf 1 
= -Àz,atf-1 -i -1 
ЈА2) Tore) e "z f ме M1 — wo?) dw 


= ko 5®;%%Ё-\ 
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4.21. 


4.22. 
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where k is a constant which does not depend on z. The value of k is determined as follows: Using Eq. (2.22) 


and definition (2.77) of the gamma function, we have 


| fz) dz = К | е7227%*071 042 
- x о 





k ao 
AT Jo 
ENT Г + B) = 1 
Hence, k = 4**"/T(« + B) and 
2 Го + f) Г(а + f) 


which indicates that Z is a ратта г.у. with parameters (х + f, 4). 


Consider two r.v.’s X and Y with joint pdf fyy(x, y). Determine the pdf of Z = XY. 


2> 0 


Let Z = XY and W = X. The transformation z = ху, w = x has the inverse transformation x = w, 


y = z/w, and 


ôx Ox 

a x 0 1 

д2 дм 1 
Jie, w) = ду ду "|l zy 

i: Ow w w? 


Thus, by Eq. (4.23), we obtain 





fav 2) 
w 
fan 3 dw 
Ww 


1 
fzw(z. w) = | = 
w 


and the marginal pdf of Z is 


1 
w 








m-f 


Let X and Y be independent uniform r.v.’s over (0, 1). Find the pdf of Z = XY. 


We have 


1 0<x<l,0<y<l 
0 otherwise 


Sails, y) = { 
The range of Z is (0, 1). Then 


fu =) 1 O<w<l,0<2z/w<l 
w,—]J-— А 
xY w 0 otherwise 


1 O<z<we<l 


or f. (> 2 - | 
mw) ]0 otherwise 
By Eq. (4.82), 


1 
1 
мә | би = —1п2 0<2<1 


(4.81) 


(4.82) 
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4.23. 


4.24, 


4.25. 
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—Inz О<2<1 


Thus, . 
us 0 otherwise 


72) = | 


Consider two r.v.'s X and Y with joint pdf fyy(x, y). Determine the pdf of Z = X/Y. 


Let Z = X/Y and W = Y. The transformation z = x/y, w = y has the inverse transformation x = zw, 
y = w,and 








Ox 2x 
_ д: ду w z 
Л) = |, оу [ [0 1|7" 
az ôw 


Thus, by Eq. (4.23), we obtain 


Гб, w) = Jw fyylzw, w) (4.83) 


and the marginal pdf of Z is 


Jaz) = | — wl Леом, w) dw (4.84) 


Let X and Y be independent standard normal r.v.’s. Find the pdf of Z = X/Y. 


Since X and Y are independent, using Eq. (4.84), we have 


1 
iw] __ өт "201 + 22/2 dw 


JA) = [` |w] fzw)fy(w) dw = F 


-a 


| | f° 
= — we "! +2232 dw — — we wd +2)/2 dw 
2n Jo 2n Joa 


1 
-—— —À X = 00 <2 < 00 
n(l + z^) 


which is the pdf of a Cauchy r.v. with parameter 1. 


Let X and Y be two r.v.'s with joint pdf fyy(x, y) and joint cdf Рух, y). Let Z = max(X, Y). 


(a) Find the cdf of Z. 
(b) Find the pdf of Z if X and Y are independent. 


(a) The region in the xy plane corresponding to the event (max(X, Y) < z} is shown as the shaded area in 


Fig. 4-8. Then 
ЕД2) = Р(2 < 2) = P(X < z, Y <2) = Fyy(z, 2) (4.85) 
(b) 1f X and Y are independent, then 
ЕД2) = Fy(z)Fy(z) 
and differentiating with respect to z gives 
faz) = (Р lz) + F ylz) fyl2) (4.86) 
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БЯ 


um 
p 


“Fig. 4-8 


x 


I. 


4.26. Let X and Y be two r.v.’s with joint pdf fy y(x, y) and joint cdf Fyy(x, у). Let W = min(X, Y). 
(a) Find the cdf of W. 
(b) Find the pdf of W if X and Y are independent. 


(a) The region in the xy plane corresponding to the event (min(X, Y) x w} is shown as the shaded area in 
Fig. 4-9. Then 


P(W x м) = РКХ <w) o (Y x w) 
= P(X < w) + P(Y <и) – Р(Х < w) ao (Y <w)} 


Thus, Fy(w) = Fy(w) + Fy(w) — Fyy(w, w) (4.87) 
(b) If X and Y are independent, then 
Fy(w) = Fy(w) + Fy(w) — Fy(w)Fy(w) 
and differentiating with respect to w gives 


Sul) = Ли) + fy(w) — fx(w)Fy(w) — Fy(w)fy(w) 
= (и) — Fy(w)] + ЛЮ) — Fx(w)] (4.88) 





427. Let X and Y be two r.v.’s with joint pdf fyy(x, y). Let 


= /X*+ Y? Ө = tan"! Е (4.89) 


Find feo(r, 0) in terms of fyy(x, y). 
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We assume that r > 0 and 0 < 0 < 2л. With this assumption, the transformation 


xi. yr tan! "59 
x 
has the inverse transformation 
x=rcos 0 y=rsin 6 
Since 
ôx Ox 


ôr 20 cos@ —rsin 6 

= =r 
oy ду sin 0 r cos 0 
or 00 


Дх, y) = 


by Eq. (4.23) we obtain 
Srolt. 9) = у cos б, r sin 6) 


4.28. A voltage V is a function of time t and is given by 


V(t) = X cos ot + Y sin wt 
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(4.90) 


(4.91) 


in which о is a constant angular frequency and X = Y = N(0; c?) and they are independent. 


(a) Show that V(t) may be written as 
V(t) = R cos (ct — Ө) 
(b) Find the pdf's of r.v.'s R and O and show that R and O are independent. 


(a) We have 
V(t) = X cos wt + Y sin «t 


= ХЕ Yi = ов в) 
/X? + ү? /X? 4 Y? 

= AJ X? + Y?(cos © cos wt + sin Ө sin ct) 

= R cos(ot — Ө) 


COS wt + 


Y 
where R= X+ Y? and © = (апт! X 


which is the transformation (4.89). 
(b) Since X = Y = N(0; c?) and they are independent, we have 


x = EN = (x2 + y2)/((2a2 
Srl , y) 2 ze (x? +y2)/(2a2) 
5, using Eq. (4.90), we get 


Sralt, 9) = rfyy(r cos 0, r sin 6) = — ;e 70e 
2лт 


2п r -r » 2n r 2н, 
Now far) = f Геі”, 9) 40 = Ino? e "Qn [ 40 = z 677210292) 


o 


> 1 
re 02) dr = — 
o 2л 





/е(0) = [ Jrolr, 8) dr = 
o 


2ла? 


and fre(r, 6) = falr) fo(0); hence, R and © are independent. 
Note that R is a Rayleigh r.v. (Prob. 2.23), and © is a uniform r.v. over (0, 2л). 


(4.92) 


(4.93) 


(4.94) 


(4.95) 
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4.29. Let X, Y, and Z be independent standard normal r.v.’s. Let W = (X? + Y? + Z?)!?, Find the 
pdf of W. 


We have 


1 2+у?+>:2 
far y. 2) = ООЛО) = Ол? e wt 


and Fy(w) = PIW < w) = Р(Х? + Y? + 22 < м2) 


-fa 27) 771372 © есе» +з? dx dy dz 
л) 


where Ry = {(х, у, 2): x? + y? + z? < w?), Using spherical coordinates (Fig. 4-10), we have 
х + у +22 = р? 
dx dy dz =r? sin 0 dr 40 do 





and Fy(w) = ORA wb [ E 700202 sin 0 dr 40 аф 
a" ml io |" sin oao |" 700202 q 
п 
— (лу2) Г гел”? dr (4.96) 
(2n)? o 
Thus, the pdf of W is 
2 
d fees w»60 
fy) =-— Рин) = {уя (4.97) 
0 w <0 





Fig. 4-10 Spherical coordinates. 


4.30. Let X,,..., X, be n independent r.v.’s each with the identical pdf f(x). Let Z = max(X,, ..., Xn). 
Find the pdf of Z. 


The probability P(z « Z « z + dz) is equal to the probability that one of the r.v.'s falls in (z, z + dz) 
and all others are less than z. The probability that one of Х, (i = l, ..., n) falls in (z, z + dz) and all others 
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4.31. 


4.32. 


are all less than z is 


S (2) ae( |" f(x) ej | 


Since there аге n ways of choosing the variables to be maximum, we have 


z avi 
faz) = vex T fix) ax) = GF ' (4.98) 


When n — 2, Eq. (4.98) reduces to 


J2) = 2f (z) { Six) dx = 2f(z)F(z) (4.99) 


o 


which is the same as Eq. (4.86) (Prob. 4.25) with /у(2) = fy(z) = f(z) and F(z) = F(z) = F(z). 


Let X,,..., X, be n independent r.v.'s each with the identical pdf f(x). Let W = min(X,, ..., X,). 
Find the pdf of W. 


The probability P(w < W < w + dw) is equal to the probability that one of the г.у. falls in (w, w + dw) 
and all others are greater than w. The probability that one of X; (i = 1, ..., n) falls in (w, w + dw) and all 


others are greater than w is 
x n-i 
f(w) e] f(x) ax) 


Since there are n ways of choosing the variables to be minimum, we have 


20 п- 
ію) = won | f(x) ax) = nfi1- F(z)" (4.100) 


w 


When n = 2, Eq. (4.100) reduces to 


өз 


Jw(w) = 2f(w) { f(x) dx = Y (w)[1 — F(w)] (4.101) 


which is the same as Eq. (4.88) (Prob. 4.26) with f,(w) = fy(w) = f(w) and Fy(w) = Fy(w) = F(w). 


Let X;, i — 1, ..., n, be n independent gamma r.v.’s with respective parameters (a;, 4), i = 1, ..., 
n. Let 


Ү=Х,++Х„=) X, 


Р= 1 
Show that Y is also а gamma r.v. with parameters (Уу a;, 4). 


We prove this proposition by induction. Let us assume that the proposition is true for n = k; that is, 


k 
Z=X vc Ху = Y X, 
i=] 


k 
is a gamma r.v. with parameters (f, 4) 2 (Y. о, 4). 
і= 1 


+1 
Let W=Z+X,.,= Y, X, 
i=) 


Then, by the result of Prob. 4.20, we see that W is a gamma r.v. with parameters (В + œp 4) = 
(У #11 ai, А). Hence, the proposition is true for n = k + 1. Next, by the result of Prob. 4.20, the proposition 


is true for n = 2. Thus, we conclude that the proposition is true for any n > 2. 
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Let X,,..., X, be n independent exponential r.v.’s each with parameter 4. Let 
Y=X t: +Х,= Y X, 
i= 
Show that Y is a gamma r.v. with parameters (n, 4). 


We note that an exponential r.v. with parameter 4 is a gamma r.v. with parameters (1, A) (Prob. 2.24). 
Thus, from the result of Prob. 4.32 and setting a; = 1, we conclude that Y is a gamma r.v. with parameters 
(n, A). 


Let Zi, ..., Z, ben independent standard normal r.v.’s. Let 
Y-Z?) + -+Z =} Z? 
Find the pdf of Y. 
Let Y, = Z;?. Then by Eq. (4.75) (Prob. 4.7), the pdf of Y, is 
1 e v2 


0 
Suv) = {лу у> 


0 у<0 





Now, using Eq. (2.80), we can rewrite 


dyn _ Зе Py gerant 
е = te = 1 


ny Jn i г) 
and we recognize the above as the pdf of a gamma г.у. with parameters (4, +) [Eq. (2.76)]. Thus, by the 
result of Prob. 4.32, we conclude that Y is the gamma r.v. with parameters (n/2, +) and 


le "xy! _ езү}! > 0 
K= гуз  —£?rw» ^ (4.102) 
0 у<0 
When n is an even integer, Г(л/2) = [(n/2) — 1]!, whereas when и is odd, T(n/2) can be obtained from 
T(x) = (a — 1)Г(« — 1) [Eq. (2.78)] and r(4) = Jn [Eq. (2.80)]. 

Note that Equation (4.102) is referred to as the chi-square (y^) density function with n degrees of 
freedom, and Y is known as the chi-square (у?) r.v. with n degrees of freedom. It is important to recognize 
that the sum of the squares of n independent standard normal r.v.'s is a chi-square r.v. with n degrees of 
freedom. The chi-square distribution plays an important role in statistical analysis. 


Let X,, X,, and X, be independent standard normal r.v.'s. Let 


Y=X,+X,4+ X3 


Ү -X,-X,; 
ъ= Х – X; 
Determine the joint pdf of Ү,, Ү,, and Y}. 
Let Vy =X, + X; Xj 
У =X, — XQ (4.103) 


Y3 5 X3 — X3 
By Eq. (4.32), the jacobian of transformation (4.103) is 
1 1 1 


Дх œ x2; x3) =|1 -1 0|23 
0 1 =! 
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Thus, solving the system (4.103), we get 
x, = iy, + 2у,+ ys) 
х= (у= Yo + ys) 
ху = iyi — ya — 2у3) 
Then by Eq. (4.31), we obtain 
yit 2у›+ Ya Yi — Ya t Y Yi Y —2 
URAANI Yo, уз) = aan (BAB, ta, — — 


Since X,, X2, and X, are independent, 


3 
і - 
Jaial Xo X25 Хз) = П fx(x) = ED e (x12 + x22 + хэ1у2 
Hence f ( )= 1 оао 
, YiY2Ys Jp Ya) Уз) = 30a е 
+ 2у› + у;\? — 2 -y,- 2 
where q(y,, yn yy = pL 4 (=) + [ee] 


= iy? + i» + уз? + iiy 


EXPECTATION 

4.36. Let X be a uniform r.v. over (0, 1) and Y = ех. 
(а) Find E(Y) by using fY(y). 
(b) Find E(Y) by using fy(x). 
(a) From Eq. (4.76) (Prob. 4.9), 


1 

- 1 <у<е 
SAY) = Sy 

0 otherwise 

Hence, E(Y) = f yf dy = f dy=e-1 
— 0 1 
(b) The pdf of X is 

1 O<x<1 

Бо) = » otherwise 


Then, by Eq. (4.33), 


E: 1 
an- | efi deo (ее 
w О 


437. Let Y — aX + b, where a and b are constants. Show that 
(a) E(Y) = E(aX + b) = aE(X) - b 
(b) Var(Y) = Var(aX + b) = a? Var(X) 
We verify for the continuous case. The proof for the discrete case is similar. 


(а) Ву Eq. (4.33), 


E(Y) = E(aX + b) = | (ax + b) fy(x) dx 


— 0o 


=a F xfy(x) dx + b Г (х) dx = aE(X) + b 


-% 
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(4.104) 


(4.105) 
(4.106) 
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(b) Using Eq. (4.105), we have 


Var(Y) = Var(aX + b) = E((aX + b — [aE(X) + by] 
= Efa? [X — E(X))) = a E((X — B(X)P} = a? Var(X) 


4.38. Verify Eq. (4.39). 


Using Eqs. (3.58) and (3.38), we have 
E[E(Y] Xy] = | E(Y | х)/у(х) dx = | ll Malo ay |j dx 


| P r AED ow as i- FAT fois ух] 
=w ж 0 Ло) = © = 


| vfi) dy = ELY] 


4.39. Let Z = aX + bY, where a and b are constants. Show that 


4.40. 


E(Z) = E(aX + bY) = aE(X) + bE(Y) 


We verify for the continuous case. The proof for the discrete case is similar. 


a 


E(Z) = E(aX + bY) = | Г (ax + Бу) fyy(x. y) dx dy 


-=a -ax 


=a | | xfyy(x, y) dx dy + Б | | Уух, y) dx dy 


=a | || Јуу(х, y) a| dx +b | | || fx. у) ax dy 


=a | Х/у(х) dx +b | | yfy(y) dy = aE(X) + ЬЕ(Ү) 


Note that Eq. (4.107) (the linearity of E) can be easily extended to n r.v.'s: 


(а х) = Ya EX) 
=i i=l 


Let Y = aX 4 b. 
(a) Find the covariance of X and Y. 


(b) Find the correlation coefficient of X and Y. 


(а) By Eq. (4.107), we have 
E(X Y) = E[X(aX + Бу] = aE(X?) + bE(X) 
E(Y) = E(aX + b) = aE(X) +b 
Thus, the covariance of X and Y is [Eq. (3.57)] 
Cov(X, Y) = суу = E(XY) — E(X)E(Y) 

= aE(X?) + bE(X) — E(X)[aE(X) + b] 

= alE(X?) — [E(X)]*} = aos? 
(b) By Eq. (4.106), we have o, = |а| осу. Thus, the correlation coefficient of X and Y is [Eq. (3.53)] 








Oxy асу? а 1 а> 0 
Pxy 


оуоу оу|а|оу lal —1 а<0 


(4.107) 


(4.108) 


(4.109) 


(4.110) 
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441. Verify Eq. (4.36). 


Since X and Y are independent, we have 
E[g(X)k(Y)] = | | (х) у(х, y) dx dy 
- | | | IORS) dx dy 


-[ gix)fx(x) ax f hy) fry) dy 


= е - 


= E[g(X)] ELA Y)] 


The proof for the discrete case is similar. 


442. Let X and Y be defined by 


X = со$ Ө Y =ш © 


where Ө is a random variable uniformly distributed over (0, 27). 


(а) 
(b) 


(a) 


(b) 


Show that X and Y are uncorrelated. 


Show that X and Y are not independent. 
We have 
0«0«2 
V л 
fe(8) = $ 2x 
0 otherwise 
e 2m 1 2n 
Then E(X) = | xfx(x) dx = | cos 6 /9(0) 40 = Sn | cos 0 40 = 0 
-x 0 o 
1 2n 
Similarly, E(Y)-— | sin 0 40 20 
2n Jo 
1 2т 1 2n 
sx e x. | cos @ sin 0d - | sin 20 dð = 0 = E(X)E(Y) 
2n Jo 4n Jo 
Thus, by Eq. (3.52), X and Y are uncorrelated. 
2 1 2n 1 j 2n 1 
Е(Х?) = — cos“ 0 40 = — (1 + cos 20) d@ =- 
2n Jo 4n Jo 2 
E(Y?) = и sin? 0 46 = l "1 — cos 20) 40 = 1 
~ 2n Jo ~ 4n Jo (2 


1 2n t 2n 1 
E(X? Y?) = — cos? Ө sin? 8 d8 c- | (1 — cos 40) 40 =- 
2n Jo 16x Jo 8 
Hence 
Е(Х?Ү?) = $ #4 = Е(Х?)Е(Ү?) 
If X and Y were independent, then by Eq. (4.36), we would have E(X?Y?) = Е(Х?)Е(Ү?). Therefore, X 
and Y are not independent. 


443. Let X,,..., X, ben r.v.s. Show that 


va(Xax)- Y Уша, Cov(X;, X) (4111) 


i=} i21 ј=1 
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If X,,..., X, are pairwise independent, then 
үа Ya х) = Уа? Var(X) (4.112) 
i=. i= 
Let Y= Y aX, 


i=l 


Then by Eq. (4.108), we have 


Var(Y) = E{LY — EY?) = gs Y a[x,- xin) | 


= Ыр: i аах, — E(X)J[X; — Eo) 
= X Ý аах, - BIE, - EX 
- X Xa a, Cov(X,, X) 
If X,,..., X, are pairwise independent, then (Prob. 3.22) 
Cov(X,, X) = MD i=j 
izj 


and Eq. (4.111) reduces to 


vaf $ а; х) = X а? Var(X;) 


MOMENT GENERATING FUNCTIONS 
4.44. Let the moment of a discrete r.v. X be given by 
E(X*) = 0.8 к= 1, 2, ... 
(а) Find the moment generating function of Х. 
(b Find Р(Х = 0) and P(X = 1). 
(a) By Eq. (4.41), the moment generating function of X is 


2 t* 
My(t) = 1 + (Ху + 2 ЕХ?) +o +, EK + 


* k 


t? 2 
=1+0. = — = ВУ — 
eos +++ -) 1+08) ү 


k! 
© * 
= 0.2 + 0.8 p ee 0.2 + 0.8е' (4.113) 
(b) By definition (4.40), 
Му) = Бе?) = У e"py(x) (4.114) 
i 


Thus, equating Eqs. (4.113) and (4.114), we obtain 
py(0) = P(X = 0) = 02 px(1) = Р(Х = 1) 2 08 
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4.45. 


4.46. 


4.47. 


Let X be a Bernoulli r.v. 
(a) Find the moment generating function of X. 
(b) Find the mean and variance of X. 
(a) By definition (4.40) and Eq. (2.32), 
М0) = Ele") = У e" py(x;) 
= ey (0) + ep (1) = (1 — p) + pe 
(b) By Eq. (4.42), 








E(X) = My) = ре] =p 
t= 0 
E(X?) = Mx(0) = pe'| =p 
120 
Hence, Var(X) = E(X?) — [E(X) = p — p? = p(1 — p) 


Let X be a binomial r.v. with parameters (n, p). 

(a) Find the moment generating function of X. 

(b) Find the mean and variance of X. 

(a) By definition (4.40) and Eq. (2.36), and letting q = 1 — p, we get 
My(t) = Бе) = У erp 


k=0 


(b) The first two derivatives of M x(t) are 
M x(t) = n(q + реу іре 
M xt) = n(q + реу ! pe + n(n — 1 + ре)" (ре) 
Thus, by Eq. (4.42), 
Hy = E(X) = М;(0) = np 
E(X?) = Mx(0) = np + n(n — Up? 
Hence, ву? = E(X?) — [Е(Х)]? = np(1 — p) 


Let X be a Poisson r.v. with parameter A. 

(a) Find the moment generating function of X. 
(b) Find the mean and variance of X. 

(a) By definition (4.40) and Eq. (2.40), 


w 
-ey mg gU age D 


(b) The first two derivatives of M (t) are 
М) = dete D 
M (t) = (àe gt D + Дете 0 
Thus, by Eq. (4.42), 
E(X) = Mi(00 =A E(X?) = М0) =A? +4 
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(4.115) 


(4.116) 


(4.117) 
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Hence, Var(X) = E(X?) -{E(X)P = А2 +04 42 = 4 


4.48. Let X be an exponential г.у. with parameter 4. 


(a) Find the moment generating function of X. 
(b) Find the mean and variance of X. 
(a) By definition (4.40) and Eq. (2.48), 


M(t) = Ele”) = | Де хех dx 


o 








4 Ы 4 
= u — 4)х = —— 
е er A>t (4.118) 
(b) The first two derivatives of M,(t) are 
4 2 
M (t) = —— "(t) = ——— 
Wsyp MO =O 
Thus, by Eq. (4.42), 
1 2 
E(X) = M,(0) — E(X?) = Mx(0) = 5 
› sva 2 IV] 
Hence, Var(X) = E(X^) — [E(X)]^ = Bux 


4.49. Find the moment generating function of the standard normal r.v. X = N(0; 1) and calculate the 
first three moments of X. 
By definition (4.40) and Eq. (2.52), 


o 
-x2 
e* 12 gtx dx 





M(t) = Ее?) = | 


a 2n 


Combining the exponents and completing the square, that is, 








x? " (х= 0)2 c 
-L x= — — 
2 2 2 
we obtain 
„|^ 1] 
M x(t) = el? | Ju g^ 07mm dy = е2 (4.119) 
=% л 


since the integrand is the pdf of N(t; 1). 
Differentiating M x(t) with respect to £ three times, we have 


М0) = te^? M(t) = (12 + De? M(t) = (P + 30e”? 
Thus, by Eq. (4.42), 
E(X) = Mi(0) =0 E(X?) = М0) = 1 E(X?) = M,(0) = 0 


4.50. Let Y — aX +b. Let M,(t) be the moment generating function of X. Show that the moment 
generating function of Y is given by 


M(t) = eM zlat) (4.120) 
By Eqs. (4.40) and (4.105), 
М) = Бе") = Егечх +5] 
= e^E(e?*) = еМ (at) 
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4.51. 


4.52. 


4.53. 


4.54. 


4.55. 


4.56. 


Find the moment generating function of a normal r.v. М(и; 07). 


If X is N(Q; 1), then from Prob. 4.1 (or Prob. 4.37), we see that Y = oX + uis N(u; o°). Then by setting 
a = c and b = pin Eq. (4.120) (Prob. 4.50) and using Eq. (4.119), we get 


M(t) = e" My(ot) = egi? — ен +910272 (4.121) 


Let X,, ..., X, be n independent r.v.'s and let the moment generating function of X, be M y(t). 
Let Y = X, +-+- + X,. Find the moment generating function of Y. 


By definition (4.40), 


My(t) = Бе?) = E[(gé 97 7*9] = Fett o е) 
= E(e'*!) +- E(e'""^) ^ (independence) 
= My (t) My (0) (4.122) 


Show that if X,, ..., X, are independent Bernoulli r.v.'s with the parameter p, then Y = X, + 
X, is a binomial r.v. with the parameters (n, р). 


Using Eqs. (4.122) and (4.115), the moment generating function of Y is 


n 


My = Па + pe) = (q + pe)" q=1-p 


which is the moment generating function of a binomial r.v. with parameters (n, p) (Eq. (4.116)]. Hence, Y is 
a binomial r.v, with parameters (и, р). 


Show that if X, ..., X, are independent Poisson r.v.’s X; having parameter 4;, then Y = X, + 
- + X, is also a Poisson r.v. with parameter A= A, + + 4,. 


Using Eqs. (4.122) and (4.117), the moment generating function of Y is 


n 
M(t) = Il g^ Ө = (®Айе-1) оке 0) 


i-1 
which is the moment generating function of a Poisson r.v. with parameter 4. Hence, Y is a Poisson r.v. with 
parameter А = УД = À, + c A. 
Note that Prob. 4.15 is a special case for n — 2. 


Show that if X,, ..., X, are independent normal r.v/'s and X; = №; о), then Y = X, + 
- + X, is also a normal r.v. with mean и = иу +*+ + p, and variance о? = о +++ +4," 


Using Eqs. (4.122) and (4.121), the moment generating function of Y is 


Mt) = П ешт +00122) —. Quir + (0i2)2/2 _ out o2t2/2 


i=) 


which is the moment generating function of a normal r.v. with mean и and variance o?. Hence, Y is a 
normal r.v. with mean и = p, +++: + u, and variance o? = 4,7 +---+4,7 
Note that Prob. 4.18 is a special case for n = 2 with и = 0 and o? = 


Find the moment generating function of a gamma r.v. Y with parameters (n, 4). 


From Prob. 4.33, we see that if Х,, ..., X, are independent exponential r.v/s, each with parameter 4, 
then Y = X, +++ X, is a gamma г.у. with parameters (n, 4). Thus, by Eqs. (4.122) and (4.118), the 


moment generating function of Y is 
^ À 4 wv 
M(t) = — ] =| —— 4.123 
a П) (4) (4123) 
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CHARACTERISTIC FUNCTIONS 


4.57. The r.v. X can take on the values x, = —1 and x, = +1 with pmfs py(x,) = py(x;) = 0.5. 
Determine the characteristic function of X. 
By definition (4.50), the characteristic function of X is 


V (о) = 0.Se ^)" + 0.5е® = Hel + eJ") = cos w 


4.58. Find the characteristic function of a Cauchy r.v. X with parameter a and pdf given by 


a 


= —* -0 «x < о 
n(x? + a?) 


Sax) 


By direct integration (or from the Table of Fourier transforms in Appendix B), we have the following 
Fourier transform pair: 
2a 
o? + а? 





e 4 e 


Now, by the duality property of the Fourier transform, we have the following Fourier transform pair: 


2a 
x +a 





€» 2ne "l7 9l = Qne lol 
2 


or (by the linearity property of the Fourier transform) 


a e e lel 
n(x? + a?) 
Thus, the characteristic function of X is 
V (a) = e ale! (4.124) 


Note that the moment generating function of the Cauchy r.v. X does not exist, since E(X") > oo for n > 2. 


4.59. The characteristic function of a r.v. X is given by 
1 [0] [0] «1 
Р = 
x) А loj» 1 
Find the pdf of X. 


From formula (4.51), we obtain the pdf of X as 


a0 


= l yw - fox 4 
fx) =у- a x(@)e o 


0 1 
= || (1 + ође io | (1 — wje ех io | 
2n [J-i o 


; ; 1 
(2 — e* е) = — (1 — cos x) 





2nx? nx? 
_ 1 [sin(x/2 P exc 
2л х/2 vee 


4.60. Find the characteristic function of a normal r.v. X = N(u; o°). 


The moment generating function of N(u; a?) is [Eq. (4.121)] 


Муй) = eto 


CHAP. 4] FUNCTIONS OF RANDOM VARIABLES, EXPECTATION, LIMIT THEOREMS 155 


Thus the characteristic function of N(u; o?) is obtained by setting t = jw in M y(t); that is, 


Yw) = еш +220212 = gin 7 9202/2 (4.125) 


12 jo 


4.61. Let Y = aX + b. Show that if Ҹу) is the characteristic function of X, then the characteristic 
function of Y 1s given by 


Ww) = el" (ао) (4.126) 
By definition (4.50), 


Ч (ш) = Ele) = Eleita +5] 
= et Efex) = ei (aw) 


4.62. Using the characteristic equation technique, redo part (b) of Prob. 4.16. 
Let Z = X + Y, where X and Y are independent. Then 
PAo) = Ele) = E(X * 0) = (ебе) Eer) 
= V (oy (о) (4.127) 
Applying the convolution theorem of the Fourier transform (Appendix B), we obtain 


fz) = F~ TYO] = 5 "DP (Qo) о) 
= fy) * fy(z) = f Sx) f — x) dx 


THE LAWS OF LARGE NUMBERS AND THE CENTRAL LIMIT THEOREM 
4.63. Verify the weak law of large numbers (4.58); that is, 
lim P(|X, —u| > £920 гапу є 


л» со 


1 
where X, = - (X, ++: + X,) and Е(Х) = p, Var(X)) = 0°. 


Using Eqs. (4./08) and (4.112), we have 


E(X,)= and Var(X,) = = (4.128) 
Then it follows from Chebyshev’s inequality [Eq. (2.97)] (Prob. 2.36) that 
o c? 
Р(Х, -ul> 95 (4.129) 


Since lim,- o?/(nc^) = 0, we get 


п со 


lim P(| X, — иі > є) = 0 


n- o 


4.64. Let X be a r.v. with pdf f,(x) and let X,, ..., X, be a set of independent r.v.’s each with pdf fy(x). 
Then the set of r.v.’s X,, ..., X, is called a random sample of size n of X. The sample mean is 
defined by 


1 п 
= УХ, (4.130) 


i-1 


1 
Х,= (Xi ++ X.) = 
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Let X,, ..., X, be a random sample of X with mean и and variance о?. How many 
samples of X should be taken if the probability that the sample mean will not deviate from the 
true mean и by more than о/10 is at least 0.95? 


Setting є = 0/10 in Eq. (4.129), we have 


_ c 5 c в? 100 
Р||Х„— і Р | ~~ = — 
( a n>) ( = | < ) mu " 


_ 1 
or њих, а 5) 21-190 
10 п 


Thus if we want this probability to be at least 0.95, we must have 100/n < 0.05 or n > 100/0.05 = 2000. 


Verify the central limit theorem (4.61). 


Let X,..., X, be a sequence of independent, identically distributed r.v's with E(X) =p and 
Var(X;) = o?. Consider the sum S, = X, +--+ + X,. Then by Eqs. (4.108) апа (4.112), we have E(S,) = nu 
and Var(S,) = no?. Let 








z,- om ts (== (4.131) 


Then by Eqs. (4.105) and (4.106), we have E(Z,) = 0 and Var(Z,) = 1. Let M(t) be the moment generating 
function of the standardized r.v. Y, = (X; — u)/c. Since E(Y) = 0 and Е(Ү,?) = Var(Y) = 1, by Eq. (4.42), we 
have 


M(0)=1 M'(0) = E(Y) = 0 М"(0) = EY?) = 1 


Given that M'(t) and M”(t) are continuous functions of t, a Taylor (or Maclaurin) expansion of M(t) about 
t = 0 сап be expressed as 


? ? 
M(t) = M(0) + M'(0) + M") Pri 1+ M%(t,) 3 0ziuzt 
By adding and subtracting г2/2, we have 
M(t) = 1 +422  AEM"(t,) — 1]? (4.132) 
Now, by Eqs. (4.120) and (4.122), the moment generating function of Z, is 


м,() = [n7] (4.133) 


Using Eq. (4.132), Eq. (4.133) can be written as 


maid |1005) eon) | 
240) = + 2 Jn + 2 (t4) — y: 


where now t, is between 0 and t/ n. Since M"(t) is continuous at t = 0 and г, > 0 as n > оо, we have 


lim [M"(t) - 1] = M'(0 121—120 


п оо 


Thus, from elementary calculus, lim, „(1 + х/п)" = е“, and we obtain 


2 1 a 
lim Mz (i) = lim t ID [M"(t) — пе} 
^ 2n 2n 


п о noo 


2 2N^ 
lim ( + 22 = el 


п п 


The right-hand side is the moment generating function of the standard normal г.у. Z = №0; 1) [Eq. 
(4.119)]. Hence, by Lemma 4.2 of the moment generating function, 


lim Z, = N(0; 1) 


no 
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4.66. 


4.67. 


Let X,,..., X, be n independent Cauchy r.v.’s with identical pdf shown in Prob. 4.58. Let 


У X, 
п= 1 


Ү, = 


sie 


LOS b X) 
(a) Find the characteristic function of Y,. 
(b) Find the pdf of Y,. 
(c) Does the central limit theorem hold? 
(a) From Eq. (4.124), the characteristic function of X; is 
V (a) = ei 
Let Y = X, +-+: + Х,. Then the characteristic function of Y is 
Wo) = Ele") = Epei»t t] = TT wa) = eme (4.134) 
i= 


Now Y, = (1/n)Y. Thus, by Eq. (4.126), the characteristic function of Y, is 
e 
Y, (0) = (2) = ет "ео — gall (4.135) 
п 


(b) Equation (4.135) indicates that Y, is also a Cauchy r.v. with parameter a, and its pdf is the same as that 
of X;. 

(c) Since the characteristic function of Y, is independent of n and so is its pdf, Y, does not tend to a normal 
r.v. as n > ос, and so the central limit theorem does not hold in this case. 


Let Y be a binomial r.v. with parameters (n, p). Using the central limit theorem, derive the 
approximation formula 


PY < у) x 4-27) (4.136) 


мп1 — р) 
where Ф(2) is the cdf of a standard normal г.у. [Eq. (2.54)]. 


We saw in Prob. 4.53 that if X,, ..., X, are independent Bernoulli r.v.’s, each with parameter p, then 
Y =X, ++ + Х, is a binomial r.v. with parameters (n, p). Since X;'s are independent, we can apply the 
central limit theorem to the r.v. Z, defined by 


і л ха) 1 (= 
Z, = —= (A) -—- ——— (4.137) 
n 2, Var(X;) n à pl — p) 
Thus, for large n, Z, is normally distributed and 
P(Z, € х) а D(x) (4.138) 


Substituting Eq. (4.137) into Eq. (4.138) gives 


cu m -n)e:]- rre 
p|————— (Х,= р | <x | = PLY < xJnp(1 — p) + np] ж P(x) 
| np(l — p) 2, 


= п 
ог P(Y < y)x (2—7) 
м np(l — p) 
Because we are approximating a discrete distribution by a continuous one, a slightly better арргох- 


imation is given by 


1 — 
PY < у) = 22-7) (4.139) 


vapil — p) 


Formula (4.139) is referred to as a continuity correction of Eq. (4.136). 
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4.68. 


4.69. 


4.70. 


4771. 
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Let Y be a Poisson r.v. with parameter 4. Using the central limit theorem, derive approximation 
formula: 


— А 
P(Y < у) (>=) (4.140) 
We saw in Prob. 4.54 that if X,, ..., X, are independent Poisson r.v.’s X; having parameter А,, then 
Y = X,+---+X, is also a Poisson r.v. with parameter A = A, + ++: + A,. Using this fact, we can view a 
Poisson r.v. Y with parameter A as a sum of independent Poisson r.v.'s X;, i = 1,..., n, each with parameter 


Afn; that is, 


The central limit theorem then implies that the r.v. Z defined by 
Y-E(Y) Y-A 
ЮМ Y-i 





2—2 (4.141) 
JNa(Y) Sa 
is approximately normal and 
P(Z < 2) =: D(z) (4.142) 
Substituting Eq. (4.141) into Eq. (4.142) gives 
Y-A 
of Ji «iens AZ + А) а Ф(2) 
—À 
or P(Y <y)® wi ) 
Again, using a continuity correction, a slightly better approximation is given by 
5—4 
PLY < у) ж (==) (4.143) 


Supplementary Problems 


Let Y = 2X + 3. Find the pdf of Y if X is a uniform r.v. over (— 1, 2). 
i l<y<7 
Ans. f) = f , 


0 otherwise 


Let X be a r.v. with pdf f(x). Let Y = | X |. Find the pdf of Y in terms of f(x). 


fo) -f-—» y»-0 


Ans. m= у<0 


Let Y = sin X, where X is uniformly distributed over (0, 2). Find the pdf of Y. 
1 


Ans. fly) = \л/1— у? 
0 otherwise 


-l<y<l 
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4.72. Let X and Y be independent r.v.’s, each uniformly distributed over (0, 1). Let Z= Х + Ү, И = Х – Ү, 
Find the marginal pdf's of Z and W. 


2 0 <2< 1 w+] -l<w<0 
Ans. f{z)=\—-z+2 1«z«2 fyw) = $—w-l O«w«l 
0 otherwise 0 otherwise 


4.73. Let X and Y be independent exponential r.v.’s with parameters а and f, respectively. Find the pdf of 
(a) Z = X — Y; (b) Z = X/Y; (с) Z = max(X, Y); (d) Z = min(X, Y). 








2B e z>0 aß >0 
—— z 
Ans. (a) f(z) = af (b faz) = \(az + В)? 
e 2<0 0 z «0 
а + В 
_ fae 1 —e 9 + е P (1 —e*) z>0 
(c) Р) = n z«0 
|j fe е Юе 2> 0 


4.74. Let X denote the number of heads obtained when three independent tossings of a fair coin are made. Let 
Y = Х2. Find E(Y). 


Ans. 3 


4.75. Let X be a uniform r.v. over (— 1, 1). Let Y = X”. 


(a) Calculate the covariance of X and Y. 
(b) Calculate the correlation coefficient of X and Y. 


J 3(2n + 1 
Ans. (a) Co«X,Y)-4n42 7794 (b) руу = eS n= odd 
0 п = even 0 п = еуеп 
4.76. Let the moment generating function of a discrete r.v. X be given by 
М g(t) = 0.25e' + 0.35e?' + 0.40е°! 
Find P(X = 3). 
Ans. 0.35 


4.77. Let X be a geometric r.v. with parameter p. 


(a) Determine the moment generating function of X. 
(b) Find the mean of X for p = å. 


f 


Ans. (a) м0 = ү t<~—Ingq=!—p (b) E(X) = 


telo 


4.78. Let X bea uniform r.v. over (a, b). 


(a) Determine the moment generating function of X. 
(b) Using the result of (a), find E(X), E(X?), and E(X?). 


tb е! 


Ans. (а) Ml) = a 


(b) E(X) = (b + a), E(X?) = 4b? + ab + a?) E(X?) = }(Ь? + b?a + ba? + а?) 
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4.79. 


4.80. 


481. 


4.82. 


4.83. 


4.84. 


4.85. 


4.86. 


4.87. 
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Consider a r.v. X with pdf 


1 
J/ 3n 


~x + 7)2/32 





Л) = —o «x«o 


e 


Find the moment generating function of X. 


Ans. My(t) = e777 8? 


Let X = N(0; 1). Using the moment generating function of X, determine F(X”). 


4 EX» = 0 n=1,3.5,... 
ns. ee en-N п= 2, 4, 6,... 


Let X and Y be independent binomial r.v.’s with parameters (n, p) and (m, р), respectively. Let Z = X + Y. 
What is the distribution of Z? 
Hint: Use the moment generating functions. 


Ans. Z isa binomial r.v. with parameters (n + m, р). 


Let (X, Y) be a continuous bivariate r.v. with joint pdf 


ein x>Oy>0 


foo у) = | 


0 otherwise 


(d) Find the joint moment function of X and Y. 


(b) Find the joint moments m,o, то, and т, |. 


1 
Апз. (а) Mti t) = турт) (b mo= 1. т = 1, m, = 1 


Let (X, Y) be a bivariate normal r.v. defined by Eq. (3.88). Find the joint moment generating function of X 
and Y. 


Ans. M gy, t3) = ete tom tide + 2гизохдур = 1220722 


Let X,,..., X, be n independent r.v.'s and X; > 0. Let 


Show that for large n, the pdf of Y is approximately log-normal. 

Hint: Take the natural logarithm of Y and use the central limit theorem and the result of Prob. 4.10. 

Let Y 2 (X — Ay / A. where X is a Poisson r.v. with parameter 4. Show that Y ~ N(0; 1) when 4 is suffi- 
ciently large. 

Hint: Find the moment generating function of Y and let 4 > oc. 

Consider an experiment of tossing a fair coin 1000 times. Find the probability of obtaining more that 520 
heads (a) by using formula (4.1 36), and (b) by formula (4.139). 

Ans. (a) 0.1038 (b) 0.0974 

The number of cars entering a parking lot is Poisson distributed with a rate of 100 cars per hour. Find the 


time required for more than 200 cars to have entered the parking lot with probability 0.90 (a) by using 
formula (4.140), and (b) by formula (4.143). 


Ans. (а) 2.189h (b) 2.1946 h 


Chapter 5 





Random Processes 


5.1 INTRODUCTION 


In this chapter, we introduce the concept of a random (or stochastic) process. The theory of 
random processes was first developed in connection with the study of fluctuations and noise in physi- 
cal systems. A random process is the mathematical model of an empirical process whose development 
is governed by probability laws. Random processes provides useful models for the studies of such 
diverse fields as statistical physics, communication and control, time series analysis, population 
growth, and management sciences. 


52 RANDOM PROCESSES 
1. Defintion: 


A random process is a family of r.v.’s {X(t), t € T! defined on a given probability space, indexed 
by the parameter t, where t varies over an index set Т. 

Recall that a random variable is a function defined on the sample space 5 (Sec. 2.2). Thus, a 
random process {X(t), t e T) is really a function of two arguments (X(t, 0), t e T, ¢ e 5). For a fixed 
(=), X(t,, ©) = Хб) is a r.v. denoted by X(t,), as ¢ varies over the sample space S. On the other 
hand, for a fixed sample point $, € S, X(t, ©) = X((t) is a single function of time t, called a sample 
function or a realization of the process. The totality of all sample functions is called an ensemble. 

Of course if both ¢ and t are fixed, X(t,, ©) is simply a real number. In the following we use the 
notation X(t) to represent X(t, ©). 


B. Description of a Random Process: 


In a random process (X(t), іє T}, the index set T is called the parameter set of the random 
process. The values assumed by X(t) are called states, and the set of all possible values forms the state 
space E of the random process. If the index set T of a random process is discrete, then the process is 
called a discrete-parameter (or discrete-time) process. A discrete-parameter process is also called a 
random sequence and is denoted by 1 X,, n = 1, 2, ...}. If T is continuous, then we have a continuous- 
parameter (or continuous-time) process. If the state space E of a random process is discrete, then the 
process is called a discrete-state process, often referred to as a chain. In this case, the state space E is 
often assumed to be (0, 1, 2, ...}. If the state space E is continuous, then we have a continuous-state 
process. 

A complex random process X(t) is defined by 


X(t) = Хү( + jX) 


where X ,(t) and X(t) are (real) random processes and j = /—1. Throughout this book, all random 
processes are real random processes unless specified otherwise. 


53 CHARACTERIZATION OF RANDOM PROCESSES 
A. Probabilistic Descriptions: 


Consider a random process X(t). For a fixed time t, X(t;) = X, isa r.v., and its cdf Fy(x,3 tj) is 
defined as 


Fy(x15 t) = PIX) € xi] (5.1) 
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Fy(x,; t4) is known as the first-order distribution of X(t). Similarly, given t, and t,, X(t,) = X, and 
X(t,) = X, represent two r.v.'s. Their joint distribution is known as the second-order distribution of 
X(t) and is given by 


Fy(x,, X25 ty, t2) = Р(Х) € xy, X(5) € x2} (5.2) 
In general, we define the nth-order distribution of X(t) by 
Fy(Xq, 065 Xps 0) = PUX € x, X(t) € x,] (5.3) 
If X(t) is a discrete-time process, then X(t) is specified by a collection of pmf's: 
DylX yy s Xm bo 5) = PIX) = х... X(t,) = x) (5.4) 
If X(t) is a continuous-time process, then X(t) is specified by a collection of pdf's: 
fax Xy tp s by) = CH ur cct "3 cub) (5.5) 


The complete characterization of X(t) requires knowledge of all the distributions as n > со. Fortu- 
nately, often much less is sufficient. 
B. Mean, Correlation, and Covariance Functions: 


As in the case of r.v.’s, random processes are often described by using statistical averages. 
The mean of X(t) 1s defined by 


ux(t) = ЕГХ(0)] (5.6) 


where X(t) is treated as a random variable for a fixed value of t. In general, ux(t) is a function of time, 
and it is often called the ensemble average of X(t). A measure of dependence among the r.v.'s of X(t) is 
provided by its autocorrelation function, defined by 


Куй, 5) = E[X(t)X(s)] (5.7) 

Note that 
Ry(t, s) = Ry(s, t) (5.8) 
and Ry(t, t) = ELX?^(0] (5.9) 


The autocovariance function of X(t) is defined by 
K x(t, 5) = Cov[X(0, X(s)] = E(LX() — ux(0]LX(s) — ux($1]] 
= Ry(t, 5) — ux(Dux(s) (5.10) 
It is clear that if the mean of X(t) is zero, then Kx(t, s) = Ry(t, s). Note that the variance of X(t) is 
given by 
cx*(t) = Var[X(t)] = E{LX() — ux(0]^] = Kx(t, t) (5.11) 


If X(t) is a complex random process, then its autocorrelation function R (t, s) and autocovariance 
function K y(t, s) are defined, respectively, by 


Ry(t, 5) = ELX(OX*(s)] (5.12) 
and Куй, s) = E{LX(t) — ux(]LX(S) — их(5)1*} (5.13) 
where * denotes the complex conjugate. 


54 CLASSIFICATION OF RANDOM PROCESSES 


If a random process X(t) possesses some special probabilistic structure, we can specify less to 
characterize X(t) completely. Some simple random processes are characterized completely by only the 
first- and second-order distributions. 
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A. Stationary Processes: 


A random process {X(t), t e T} is said to be stationary or strict-sense stationary if, for all n and 
for every set of time instants (t; € T, i = 1, 2, ..., n], 
Fy(x,, XS pss t) = Fy(xy x t o tuus t d t0) (5.14) 


for any т. Hence, the distribution of a stationary process will be unaffected by a shift in the time 
origin, and X(t) and X(t + т) will have the same distributions for апу т. Thus, for the first-order 
distribution, 


Fy(x; t) = F(x; t + т) = Ех) (5.15) 
and fx(x; t) = fx(x) (5.16) 
Then Ахй) = Е[Х(0)] = u (5.17) 
Var[X(t)] = о? (5.18) 

where д and о? are contants. Similarly, for the second-order distribution, 
Ех, X2; tj t2) = Ех, X2; t2 —1\) (5.19) 
and fais X2; bay t2) = ха, X2; t2 — 14) (5.20) 

Nonstationary processes are characterized by distributions depending on the points t4, t;, ... , t,. 


B. Wide-Sense Stationary Processes: 


If stationary condition (5.14) of a random process X(t) does not hold for all n but holds for n < k, 
then we say that the process X(t) is stationary to order k. If X(t) is stationary to order 2, then X(t) is 
said to be wide-sense stationary (WSS) or weak stationary. If X(t) is a WSS random process, then we 
have 


1. Е[Х(0)] = и (constant) (5.21) 
2. Ry(ts) = ЕГХ()Х(5)] = Ry(Is — tl) (5.22) 


Note that a strict-sense stationary process is also a WSS process, but, in general, the converse is not 
true. 


C. Independent Processes: 


In a random process X(t), if X(t;) for i = 1, 2, ..., n are independent r.v.'s, so that for n = 2, 3, ..., 


Fy(x,, ees Xy tp es ty) = П Fete: ti) (5.23) 
i-1 


then we call X(t) an independent random process. Thus, a first-order distribution is sufficient to charac- 
terize an independent random process X(t). 


D. Processes with Stationary Independent Increments: 


A random process (X(t), t > 0) is said to have independent increments if whenever 0 < t, < t, < 
eo «xL 


X(0), X(t1) — X(0), X(t2) — X(t1), ..., X(t) — X(t,-1) 
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are independent. If ( X(t), t > 0) has independent increments and X(t) — X(s) has the same distribu- 
tion as X(t + h) — X(s + h) for all s, t, h > 0, s < t, then the process X(t) is said to have stationary 
independent increments. 

Let (X(t), t > 0) be a random process with stationary independent increments and assume that 
X(0) 2 0. Then (Probs. 5.21 and 5.22) 


E(X(0] = ut (5.24) 

where д, = E( X(1)] and 
Var[X(0] = с, (5.25) 

where c,? = Var[ X(1)]. 
From Eq. (5.24), we see that processes with stationary independent increments are nonstationary. 


Examples of processes with stationary independent increments are Poisson processes and Wiener 
processes, which are discussed in later sections. 


E. Markov Processes: 
A random process {X(t), t € T} is said to be a Markov process if 
P(X(t,4 4) < Xn+1 | X(t,) = Xis X(t) = X2, ttt X(t,) = x, = P{X (ths 1) < Xn] | X(t,) = Xn} (5.26) 


whenever t, < t; € «t, < 1,1. 
A discrete-state Markov process is called a Markov chain. For a discrete-parameter Markov 
chain (X,, n > 0} (see Sec. 5.5), we have for every n 


PUX „+, =j| Xo Sio Xi, =i, ...,X, =) = P(X,,, =j|X, = (5.27) 


Equation (5.26) or Eq. (5.27) is referred to as the Markov property (which is also known as the 
memoryless property). This property of a Markov process states that the future state of the process 
depends only on the present state and not on the past history. Clearly, any process with independent 
increments is a Markov process. 

Using the Markov property, the nth-order distribution of a Markov process X(t) can be 
expressed as (Prob. 5.25) 


Еу (ху, Xy ip ss tl) = Ех 0) П PLX (ty) € xi)  X(t-)) = х-1} (5.28) 
k=2 


Thus, all finite-order distributions of a Markov process can be expressed in terms of the second-order 
distributions. 


F. Normal Processes: 


A random process (X(t), t € T} is said to be a normal (or gaussian) process if for any integer n 
and any subset {t,,...,¢,} of T, the n r.v/s X(t,),..., X(t,) are jointly normally distributed in the 
sense that their joint characteristic function is given by 


Ч sop xay (Q1 s, On) = Е{ехр Јо, Xt) ++, + 0, X(t,)]} 


. mn 1 n mn А 

= expl У, EIXU – 5 Y, Уто, Соух), хыл} (5.29) 
і= 1 і=1 к= 1 

where @,, ..., @, аге any real numbers (see Probs, 5.59 and 5.60) Equation (5.29) shows that a 

normal process is completely characterized by the second-order distributions. Thus, if a normal 

process is wide-sense stationary, then it is also strictly stationary. 
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G. Ergodic Processes: 


Consider a random process (X(t), —oo « t < 00) with a typical sample function x(t). The time 
average of x(t) 1s defined as 


1 Ti2 
€x(t)> = lim al x(t) dt (5.30) 


Toc -Tj2 


Similarly, the time autocorrelation function R,(t) of x(t) is defined as 


_ 1 ft? 

Ry(t) = (x(t)x(t + t)» = lim т | x(t)x(t + т) dt (5.31) 
To — T2 

A random process is said to be ergodic if it has the property that the time averages of sample 

functions of the process are equal to the corresponding statistical or ensemble averages. The subject 

of ergodicity is extremely complicated. However, in most physical applications, it is assumed that 

stationary processes are ergodic. 


55 DISCRETE-PARAMETER MARKOV CHAINS 


In this section we treat a discrete-parameter Markov chain (X,, n > 0) with a discrete state 
space E = (0, 1, 2, ...), where this set may be finite or infinite. If X, = i, then the Markov chain is 
said to be in state i at time n (or the nth step). A discrete-parameter Markov chain ( X,, n > 0} is 
characterized by [Eq. (5.27)] 


Р(Х, = 5 Xo = io, X; mi... Х, = = PUK =)|Х„ =й) (5.32) 


where P{x,4, =j|X, =i} are known as one-step transition probabilities. If P{x,4, = j| X, = i} is 
independent of n, then the Markov chain is said to possess stationary transition probabilities and the 
process is referred to as a homogeneous Markov chain. Otherwise the process is known as a nonhomo- 
geneous Markov chain. Note that the concepts of a Markov chain's having stationary transition 
probabilities and being a stationary random process should not be confused. The Markov process, in 
general, is not stationary. We shall consider only homogeneous Markov chains in this section. 


A. Transition Probability Matrix: 


Let (X,, n > 0) be a homogeneous Markov chain with a discrete infinite state space E = (0, 1, 
2, .... Then 


ру = P(X,., = МХ, = 0 iz0,j20 (5.33) 


regardless of the value of n. A transition probability matrix of {X,,n > 0] is defined by 


Poo Pos Ро? 
P= [р] [Ріо Pu Pra 
Pro P21 P22 
where the elements satisfy 
р; 20 Y Py = 1 і = 0, 1, 2, m (5.34) 
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In the case where the state space E is finite and equal to {1, 2,..., m}, P is m x m dimensional; that is, 
Pix Paz 7 Pim 
Р= р] = "t Fs Р" 
ml Poa UC Pram 
т 
where ру = 0 > pi = 1 i=1,2,...,m (5.35) 


A square matrix whose elements satisfy Eq. (5.34) or (5.35) is called a Markov matrix or stochastic 
matrix. 


B. Higher-Order Transition Probabilities—Chapman-Kolmogorov Equation: 


Tractability of Markov chain models is based on the fact that the probability distribution of 
{Х„,п = 0} can be computed by matrix manipulations. 

Let P = [р,;) be the transition probability matrix of a Markov chain {X,, n > 0}. Matrix powers 
of P are defined by 


Р? = РР 
with the (i, jth element given by 


Note that when the state space E is infinite, the series above converges, since by Eq. (5.34), 
Y рари < У Ри = 1 
k k 
Similarly, P? = PP? has the (i, j)th element 
pP) = у Dix Pej 


and in general, Р"*! = PP" has the (i, j)th element 
ру“? = Y pa Pej” (5.36) 
k 


Finally, we define P? — I, where I is the identity matrix. 
The n-step transition probabilities for the homogeneous Markov chain (X,, n > 0} are defined 
by 


P(X,—j|Xo— i) 
Then we can show that (Prob. 5.70) 
ру” = Р(Х„=]|Х = i) (5.37) 


We compute р,” by taking matrix powers. 
The matrix identity 


pr*m. р"р" n,m>0 
when written in terms of elements 


pat = y pap, (5.38) 
k 
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is known as the Chapman-Kolmogorov equation. It expresses the fact that a transition from i to j in 
n + т steps can be achieved by moving from i to an intermediate k in n steps (with probability р), 
and then proceeding to j from k in m steps (with probability p,,’”). Furthermore, the events “go from i 
to k in n steps" and "go from k to j in m steps" are independent. Hence the probability of the 
transition from i to j in n + m steps via i, k, j is рр". Finally, the probability of the transition 
from i to j is obtained by summing over the intermediate state k. 


C. The Probability Distribution of ( X,, n > 0): 


Let p(n) = P(X, = i) and 
p(n) = [ро(и) рү(п) pom ··-] 


where Y p,(n) = 1 
к 


Then р(0) = Р(Х, = i) are the initial-state probabilities, 


p(0) = [po(0) р,(0) p0) ·*:] 
is called the initial-state probability vector, and p(n) is called the state probability vector after n tran- 
sitions or the probability distribution of X,,. Now it can be shown that (Prob. 5.29) 
p(n) = p(0)P" (5.39) 


which indicates that the probability distribution of a homogeneous Markov chain is completely 
determined by the one-step transition probability matrix P and the initial-state probability vector 


p(0). 


D. Classification of States: 


1. 


ю 


Accessible States: 


State j is said to be accessible from state i if for some n > 0, р," > 0, and we write i >j. Two 
states i and j accessible to each other are said to communicate, and we write i<j. If all states commu- 
nicate with each other, then we say that the Markov chain is irreducible. 


Recurrent States: 
Let T; be the time (or the number of steps) of the first visit to state j after time zero, unless state j 
is never visited, in which case we set T, = oo. Then T; is a discrete r.v. taking values in (1, 2, ..., oo}. 
Let 

f" = Р(Т = т| Хо= й) = Р(Х, =), X %), к= 1, 2,..., т Хо = 0 (5.40) 

and fj” = 0 since Т, > 1. Then 
Sf? = Р(Т =1| Xo =) = P(X, = Хо = ) = py (5.41) 
and Fi = Уу pa fi? m-23.. (5.42) 

k*j 


The probability of visiting j in finite time, starting from i, is given by 


= Y Ду” = PUT, < ©|Хе =) (5.43) 


n=0 
Now state j is said to be recurrent if 
Ју = Р(Т < =©|Х =) = 1 (5.44) 
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That is, starting from j, the probability of eventual return to j is one. A recurrent state j is said to be 
positive recurrent if 


E(T;| Xo =f) < oo (5.45) 
and state j is said to be null recurrent if 
E(T,| Xo — j) = © (5.46) 
Note that 
E(T,| Xo =)= Y nf,” (5.47) 
n=0 


3. Transient States: 
State j is said to be transient (or nonrecurrent) if 
fp = РТ < с|Х=]))<1 (5.48) 
In this case there is positive probability of never returning to state j. 
4. Periodic and Aperiodic States: 
We define the period of state j to be 
d(j) = ged(n > 1: р," > 0j 


where gcd stands for greatest common divisor. 
If d(j) > 1, then state j is called periodic with period d( j). If d(j) = 1, then state j is called aperiodic. 
Note that whenever р, > 0, j is aperiodic. 


5. Absorbing States: 


State j is said to be an absorbing state if pj; = 1; that is, once state j is reached, it is never left. 


E. Absorption Probabilities: 


Consider a Markov chain X(n) = (X,, n > 0} with finite state space E = (1, 2, ..., N} and tran- 
sition probability matrix P. Let A = (1,..., m] be the set of absorbing states and В = {m + 1, ..., N} 
be a set of nonabsorbing states. Then the transition probability matrix P can be expressed as 


I 0 0 0 0 
0 1 0 0 0 
: Doa : ul : го 
Р = 0 etr ] 0 e 0 -|; A (5.49a) 
Pm+1.1 ' Utt Pm+i,m Pm+i,m+1 7 Pm+i,N 
рм. \ o7 рит PN, m+1 Uc рих 
where 1 is an m x m identity matrix, О is an m x (№ — m) zero matrix, and 
Pm+t, 1 17 Dm i.m Рта+ї,т+1 ttt Pm+i,N 
R= UO ' Q= ' ul : (5.49b) 
DN. 1 US рыт Рм, т+ Uc Рм, м 


Note that the elements of К are the one-step transition probabilities from nonabsorbing to absorbing 
states, and the elements of Q are the one-step transition probabilities among the nonabsorbing states. 
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Let U = [u,;], where 
щу = P{X, = је A)| Xo = K(e B)} 


It is seen that U is an (N — m) x m matrix and its elements are the absorption probabilities for the 
various absorbing states. Then it can be shown that (Prob. 5.40) 


U=(1—Q)'R=OR (5.50) 
The matrix Ф = (J — 0)! is known as the fundamental matrix of the Markov chain X(n). Let Т, 
denote the total time units (or steps) to absorption from state k. Let 

T= [T+ Тә 777 Ty] 
Then it can be shown that (Prob. 5.74) 
N 
ET) = У dei k=m+1,...,N (5.51) 
{=т+1 


where фу is the (k, i)th element of the fundamental matrix Ф. 


F. Stationary Distributions: 


Let P be the transition probability matrix of a homogeneous Markov chain {X,, n > 0). If there 
exists a probability vector p such that 


pP=)p (5.52) 
then р is called a stationary distribution for the Markov chain. Equation (5.52) indicates that a sta- 
tionary distribution р is a (left) eigenvector of P with eigenvalue 1. Note that any nonzero multiple of p 


is also an eigenvector of Р. But the stationary distribution ф is fixed by being a probability vector; 
that is, its components sum to unity. 


G. Limiting Distributions: 


A Markov chain is called regular if there is a finite positive integer m such that after m time-steps, 
every state has a nonzero chance of being occupied, no matter what the initial state. Let A > О 
denote that every element a;; of A satisfies the condition aj; > 0. Then, for a regular Markov chain 
with transition probability matrix P, there exists an m > 0 such that P" > О. For a regular homoge- 
neous Markov chain we have the following theorem: 


THEOREM 5.5.1 


Let {X,, n > 0) be a regular homogeneous finite-state Markov chain with transition matrix Р. 
Then 


lim Р" = P (5.53) 


n> о 


where Ê is a matrix whose rows are identical and equal to the stationary distribution p for the 
Markov chain defined by Eq. (5.52). 


5.6 POISSON PROCESSES 
A. Definitions: 


Let t represent a time variable. Suppose an experiment begins at t = 0. Events of a particular 
kind occur randomly, the first at T;, the second at Т,, and so on. The r.v. T; denotes the time at which 
the ith event occurs, and the values t; of T; (i = 1, 2,...) are called points of occurrence (Fig. 5-1). 
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Let Z-T-T,., (5.54) 


and T, = 0. Then Z, denotes the time between the (n — l)st and the nth events (Fig. 5-1). The 
sequence of ordered r.v.'s (Z,, n = 1} is sometimes called an interarrival process. If all r.v.'s Z, are 
independent and identically distributed, then (Z,, n > 1} is called a renewal process or a recurrent 
process. From Eq. (5.54), we see that 


T, = 2Z2,+2Z,+°°°+Z, 


where T, denotes the time from the beginning until the occurrence of the nth event. Thus, {T,, n = 0} 
is sometimes called an arrival process. 


B. Counting Processes: 


A random process (X(t), t > 0} is said to be a counting process if X(t) represents the total number 
of "events" that have occurred in the interval (0, t). From its definition, we see that for a counting 
process, X(t) must satisfy the following conditions: 


1. Хе) > 0 and X(0) = 0. 

2. X(t) is integer valued. 

3. X(s)< X(t) ifs <t. 

4. X(t) – X(s) equals the number of events that have occurred on the interval (s, t). 


A typical sample function (or realization) of X(t) is shown in Fig. 5-2. 

A counting process X(t) is said to possess independent increments if the numbers of events which 
occur in disjoint time intervals are independent. A counting process X(t) is said to possess stationary 
increments if the number of events in the interval (s + h, t + h)—that is, X(t + h) — X(s + h)—has the 
same distribution as the number of events in the interval (s, t)—that is, X(t) — X(s)—for all s « t and 
h> 0. 





Fig. 5-2 A sample function of a counting process. 


C. Poisson Processes: 


One of the most important types of counting processes is the Poisson process (or Poisson counting 
process), which is defined as follows: 
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DEFINITION 5.6.1 
A counting process X(t) is said to be a Poisson process with rate (or intensity) A( > 0) if 
1. Х(0) = 0. 
2. X(t) has independent increments. 


3. The number of events in any interval of length t is Poisson distributed with mean 4t; that is, for 
all s, t > 0, 


P[X(t + 5) — X(s) = n] =e Um n=0,1,2,... (5.55) 


It follows from condition 3 of Def. 5.6.1 that a Poisson process has stationary increments and that 
EL X(t)] = At (5.56) 
Then by Eq. (2.43) (Sec. 2.7C), we have 
Var[X(t)] = At (5.57) 


Thus, the expected number of events in the unit interval (0, 1), or any other interval of unit length, is 
just 4 (hence the name of the rate or intensity). 
An alternative definition of a Poisson process is given as follows: 


DEFINITION 5.6.2 
A counting process X(t) is said to be a Poisson process with rate (or intensity) A(> 0) if 
1. X(0 20. 
2. X(t)has independent and stationary increments. 
3. P[X(t + At) - X(t) = 1] =A At + o(At) 
4. PUX(t + At) — X(t) = 2] = o(At) 
where o(At) is a function of At which goes to zero faster than does At; that is, 


А) 
lim —— — 0 5.58 
ao Af 938) 


Note: Since addition or multiplication by a scalar does not change the property of approaching zero, 
even when divided by At, o(At) satisfies useful identities such as o(At) + o(At) = o(At) and 
ao(At) = o(At) for all constant a. 


It can be shown that Def. 5.6.1 and Def. 5.6.2 are equivalent (Prob. 5.49). Note that from condi- 
tions 3 and 4 of Def. 5.6.2, we have (Prob. 5.50) 


P[X(t + At) — X(t) = 0] = 1 — å At + o(At) (5.59) 


Equation (5.59) states that the probability that no event occurs in any short interval approaches unity 
as the duration of the interval approaches zero. It can be shown that in the Poisson process, the 
intervals between successive events are independent and identically distributed exponential r.v.'s 
(Prob. 5.53). Thus, we also identify the Poisson process as a renewal process with exponentially 
distributed intervals. 

The autocorrelation function Ry(t, s) and the autocovariance function K,(t, 5) of a Poisson 
process X(t) with rate A are given by (Prob. 5.52) 


Ry(t, s) = А min(t, s) + 4215 (5.60) 
K,(t, s) = А min(t, s) (5.61) 
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57 WIENER PROCESSES 
Another example of random processes with independent stationary increments is a Wiener process. 


DEFINITION 5.7.1 


A random process { X(t), t = 0} is called a Wiener process if 


1. X(t) has stationary independent increments. 

2. The increment X(t} — X(s) (t > s) is normally distributed. 
3. E[X(t)] =0. 

4. Х(0) = 0. 


The Wiener process is also known as the Brownian motion process, since it originates as a model for 
Brownian motion, the motion of particles suspended in a fluid. From Def. 5.7.1, we can verify that a 
Wiener process is a normal process (Prob. 5.61) and 


E[X(t)] = 0 (5.62) 
Уаг[Х(0)] = o?t (5.63) 


where o? is a parameter of the Wiener process which must be determined from observations. When 
c? = 1, X(t) is called a standard Wiener (or standard Brownian motion) process. 

The autocorrelation function Ry(t, s) and the autocovariance function K,(t, 5) of a Wiener 
process X(t) are given by (see Prob. 5.23) 


Ry(t, 5) = Ky(t, s) = o? min(t, s) s,t>0 (5.64) 
DEFINITION 5.7.2 
A random process {X(t), t = 0} is called a Wiener process with drift coefficient p if 


1. X(t) has stationary independent increments. 
2. X(t)is normally distributed with mean yt. 
3. X(0 20. 


From condition 2, the pdf of a standard Wiener process with drift coefficient џи is given by 


1 
Је) = m есш» (5.65) 
J2n 


Solved Problems 


RANDOM PROCESSES 


5.1. Let X,, X5, ... be independent Bernoulli r.v.’s (Sec. 2.7A) with P(X, = 1) = p and P(X, = 0) = 
q — 1 — p for all n. The collection of r.v/s (X,, n > 1} is a random process, and it is called a 
Bernoulli process. 

(a) Describe the Bernoulli process. 
(b) Construct a typical sample sequence of the Bernoulli process. 


(а) The Bernoulli process (X,, n > 1} is a discrete-parameter, discrete-state process. The state space is 
Е = (0, 1}, and the index set is T = (1, 2,...}. 
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5.2. 


5.3. 


(b) A sample sequence of the Bernoulli process can be obtained by tossing a coin consecutively. If a head 
appears, we assign 1, and if a tail appears, we assign 0. Thus, for instance, 


n 1 2 3 4 5 6 7 8 9 10 
Coin tossing H T T H H H T H H T 
x 1 0 0 1 1 1 0 1 1 0 


л 


The sample sequence {x,} obtained above is plotted in Fig. 5-3. 





0 2 4 6 8 10 п 
Fig. 5-3 А sample function of a Bernoulli process. 


Let Z,, 2,, ... be independent identically distributed гуз with P(Z,=1)=p and 
P(Z, = –1) = 4= 1 — p for all n. Let 


X,-YZ, n=1,2,... (5.66) 


i=1 
and X, = 0. The collection of rv.’s (X,, n = 0] is a random process, and it is called the simple 
random walk X (n) in one dimension. 
(a) Describe the simple random walk X(n). 
(b) Construct a typical sample sequence (or realization) of X(n). 
(a) The simple random walk X(n) is a discrete-parameter (or time), discrete-state random process. The 
state space is E = (..., —2, —1, 0, 1, 2,...}, and the index parameter set is T = (0, 1,2, ...). 


(b) А sample sequence x(n) of a simple random walk X(n) can be produced by tossing а coin every second 
and letting x(n) increase by unity if a head appears and decrease by unity if a tail appears. Thus, for 


instance, 
n 0 1 2 3 4 5 6 7 8 9 10 
Coin tossing H T T H H H T H H T 
x(n) 0 1 0 —1 0 1 2 1 2 3 2 


The sample sequence x(n) obtained above is plotted in Fig. 5-4. The simple random walk X(n) specified 
in this problem is said to be unrestricted because there are no bounds on the possible values of X, . 


The simple random walk process is often used in the following primitive gambling model: 
Toss a coin. If a head appears, you win one dollar; if a tail appears, you lose one dollar (see 
Prob. 5.38). 


Let (X,, n > 0] be a simple random walk of Prob. 5.2. Now let the random process X(t) be 
defined by 


X(t) = X, n<t<n+l 
(a) Describe X(t). 
(b) Construct a typical sample function of X(t). 


(a) The random process X(t) is a continuous-parameter (or time), discrete-state random process. The state 
space is E = (..., —2, — 1,0, 1, 2,...}, and the index parameter set is T = (t, t > 0j. 


(b) A sample function x(t) of X(t) corresponding to Fig. 5-4 is shown in Fig. 5-5. 


174 


5.4. 


5.5. 
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Fig. 5-4 A sample function of a random walk. 





Fig. 5-5 


Consider a random process X(t) defined by 
X(t) 2 Y cos wt t>0 
where « is a constant and Y is a uniform r.v. over (0, 1). 
(a) Describe X(t). 
(b) Sketch a few typical sample functions of X(t). 


(a) The random process X(t) is а continuous-parameter (or time), continuous-state random process. The 
state space is E = (x: —1 < x < 1} and the index parameter set is T = (t: t > 0). 


(b) Three sample functions of X(t) are sketched in Fig. 5-6. 


Consider patients coming to a doctor's office at random points in time. Let X, denote the time 
(in hours) that the nth patient has to wait in the office before being admitted to see the doctor. 

(a) Describe the random process X(n) = (X,, n > 1). 

(b) Construct a typical sample function of X(n). 


(a) The random process X(n) is a discrete-parameter, continuous-state random process. The state space is 
Е = (x: x > 0), and the index parameter set is Т = (1,2, ...). 


(b) Asample function x(n) of X(n) is shown in Fig. 5-7. 


CHARACTERIZATION OF RANDOM PROCESSES 


5.6. 


Consider the Bernoulli process of Prob. 5.1. Determine the probability of occurrence of the 
sample sequence obtained in part (b) of Prob. 5.1. 
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x0 


x40 





x40 
Y=0 
0 
t 
Fig. 5-6 
Since X,'s are independent, we have 
P(X, = x, X2 = xj, Xn = x) = Р(Х, = х,)Р(Х, = x3) с Р(Х, = x) (5.67) 


Thus, for the sample sequence of Fig. 5-3, 


Р(Х, = 1, X,=0, Хз = 0, Х, = 1, Х, = 1, Х„ = 1, X9 0, X4 = 1, X = 1, Xio = 0) = рб* 


x(n) 


t2 
> 
a 
© 
9 
c 
= 
ә 


176 


5.7. 


5.8. 


5.9. 
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Consider the random process X(t) of Prob. 5.4. Determine the pdfs of X(t) at t = 0, л/4о, л/20, 
л/о@. 

For t = 0, X(0) = Y cos 0 = Y. Thus, 
1 О<х< 1 


Го) = | 


0 otherwise 
For t = n/Ao, Х(п/40) = Y cos n/4 = t/,/2 Y. Thus, 


/2 90 1/2 
Јака) = [у TX An 


0 otherwise 


For ( = л/20, X(n/2w) = Y cos n/2 = 0; that is, Х(л/20) = 0 irrespective of the value of Y. Thus, the 
pmf of X(w/2@) is 


Pxinizo X) = Р(Х =0)=1 
For t = п/о, X(n/w) = Y cos n = — Y. Thus, 


l —1<х<0 


Aoc X) = n 


otherwise 


Derive the first-order probability distribution of the simple random walk X(n) of Prob. 5.2. 


The first-order probability distribution of the simple random walk X(n) is given by 
pK) = Р(Х, = К) 


where k is an integer. Note that P(X, = 0) = 1. We note that p,(k) = 0 ifn < |k] because the simple random 
walk cannot get to level k in less than |k | steps. Thus, n > ||. 

Let N,* and N,” be the r.v.'s denoting the numbers of +1s and — 15, respectively, in the first n steps. 
Then 


n=N,* +N, (5.68) 
Х,= №, = № (5.69) 

Adding Eqs. (5.68) апа (5.69), we get 
N,' =4(n+ X,) (5.70) 


Thus, X, = k if and only if N,* = (п + К). From Eq. (5.70), we note that 2N,* = п + X, must be even. 
Thus, X, must be even if n is even, and X, must be odd if n is odd. We note that N,* is a binomial r.v. with 
parameters (n, p). Thus, by Eq. (2.36), we obtain 


alk) = (a Fon? ию юз qn pp (5.71) 


where n > ||, and n and К are either both even or both odd. 


Consider the simple random walk X(n) of Prob. 5.2. 


(a) Find the probability that X(n) = —2 after four steps. 


(b) Verify the result of part (a) by enumerating all possible sample sequences that lead to the 
value X(n) = —2 after four steps. 


(a) Setting k = —2and n = 4 in Eq. (5.71), we obtain 





, 4V 4 з 
P(X, = —2) = py(—2) = (jPa = 4pq q-l-p 
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х(л) 





Fig. 5-8 
(b) All possible sample functions that lead to the value X, = —2 after 4 steps are shown in Fig. 5-8. For 
each sample sequence, P(X, = —2) = pq?. There are only four sample functions that lead to the value 


X, = --2 after four steps. Thus P(X, = —2) = 4рд?. 


5.10 Find the mean and variance of the simple random walk X(n) of Prob. 5.2. 
From Eq. (5.66), we have 
X= Xn- + 2, n=1,2,... (5.72) 
and X, = 0 and 2, (n = 1, 2,...) are independent and identically distributed (iid) r.v.'s with 
Р(2,= +1) = р Р(2,= 0) = 4 = 1-р 
From Eq. (5.72), we observe that 


Хх = Xo t Z =Z, 
X,=X%,+2,=2,4+ 2, 


(5.73) 
X,=2Z,+2Z,+::°°:4+2Z, 
Then, because the Z, are iid r.v.’s and Xo = 0, by Eqs. (4.108) and (4.112), we have 
E(X,) = «( У л) = nE(Z,) 
k=1 
Var(X,) = var( У А) = п Var(Z,) 
к= 1 
Now E(Z,) = ()р+(—1)д=р—4 (5.74) 
Е(2,?) = (?p +(-I?g=pt+q=l (5.75) 
Thus Var(Z,) = E(Z,”) — [E(Z,)]? = | — (p — a4) = 4pq (5.76) 
Hence, E(X,) = n(p — q) 4= 1-р (5.77) 
Var(X,,) = 4npq 4= 1-р (5.78) 
Note that if p = q = 3, then 
E(X,) =0 (5.79) 
Var(X,) — n (5.80) 


511. Find the autocorrelation function Ry(n, m) of the simple random walk X (n) of Prob. 5.2. 
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5.12. 


5.13. 
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From Eq. (5.73), we can express X, as 


where Zo = X, = O and 2, (i = 1) are tid r.v.’s with 
Р(2,= +1) = р P(Z;= -1) == 1-р 
By Eq. (5.7), 
Ry(n, т) = E[X(nX(m)] = E(X „X m) 
Then by Eq. (5.81), 


min(n, m) 


Ry(n, m) = Y Y E(Z, Z,) = Y Е(2;2) + у, Y E(Z)E(Z,) 
i=0 k=0 і=0 EINE 


ith 
Using Eqs. (5.74) and (5.75), we obtain 
Ry(n, m) = min(n, m) + [nm — min(n, m)](p — 4)? 


т + (пт – тр – 92 m<n 


Ry(n, т) = 
or xn, m) | + (nm — nXp — а)? п<т 
Note that if p = = 3, then 


Ry(n, т) = min(n, m) nm»0 


Consider the random process X(t) of Prob. 5.4; that is, 
X(t) 2 Y cos ot t>0 
where w is a constant and Y is a uniform r.v. over (0, 1). 
(a) Find E[X(t)]. 
(b) Find the autocorrelation function Ry(t, s) of X(t). 
(c) Find the autocovariance function K x(t, s) of X(t). 
(a) From Eqs. (2.46) and (2.91), we have E(Y) = 4 and E(Y?) = 4. Thus 
Е{Х(ї)] = E(Y cos wt) = E(Y) cos wt = $ cos wt 
(b) By Eq. (5.7), we have 
Ry(t, 5) = EL X(t)X(s)] = E(Y? cos wt cos ws) 
= E(Y?) cos wt cos ws = i COS Ot COS 05 
(c) By Eq. (5.10), we have 


Ky(t, 5) = Ry(t, s) — ELX()]EEX(S)] 
= 4 cos wt cos os — $ cos wt cos ws 
= у COS ot cos ws 


(CHAP 5 


(5.81) 


(5.82) 


(5.83) 


(5.84) 


(5.85) 


(5.86) 


(5.87) 


(5.88) 


Consider a discrete-parameter random process X(n) = (X,, n 2 1} where the X,’s are iid r.v.'s 


with common cdf F (x), mean и, and variance о?. 

(a) Find the joint cdf of X(n). 

(b) Find the mean of X(n). 

(c) Find the autocorrelation function R (n, m) of X(n). 
(d) Find the autocovariance function K x(n, m) of X(n). 
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(a) Since the X,’s are iid r.v.’s with common cdf F,(x), the joint cdf of X(n) is given by 
Ех Xp) = Пех) = [Fy(x)]" (5.89) 
i=l 


(b) The mean of X(n) is 
ux(n) = E(X) = и for all n (5.90) 
(c) Yf n + m, by Eqs. (5.7) and (5.90), 
Күп, m) = E(X, X „) = E(X E(X ,) = и? 
If n = m, then by Eq. (2.31), 
E(X,^) = Var(X,) + [E(X JJ = о? + и? 


2 


A něm 
Hence, Ry(n, т) = P" a n-m (5.91) 
(d) By Eq. (5.10), 
0 £ 
K x(n, m) = Ry(n, m) — ux(n)ux(m) = p, " - m (5.92) 


CLASSIFICATION OF RANDOM PROCESSES 


5.14. Show that a random process which is stationary to order n is also stationary to all orders lower 
than n. 


Assume that Eq. (5.14) holds for some particular n; that is, 
P(X(t) € xy, ..., X(t,) < х,) = P{X(t, +S xy... X(t, +S x} 
for any т. Letting x, — oo, we have [see Eq. (3.63)] 
PlX(t) € xy, ..., X(n) £x, ,) = PIX(, + DS ху,..., (р +T) S х„..,) 


and the process is stationary to order n — 1. Continuing the same procedure, we see that the process is 
stationary to all orders lower than n. 


5.15. Show that if {X(t}, t e T] is a strict-sense stationary random process, then it is also WSS. 


Since X(t) is strict-sense stationary, the first- and second-order distributions are invariant through time 
translation for all т e T. Then we have 


ші) = E[X()] = ELX(t + 1)] = ux(t + т) 
and hence the mean function u4(t) must be constant; that is, 
E[X(t)] = и (constant) 
Similarly, we have 
EL X(s)X(t)] = ELX(s + 1) X(t + 2)] 


so that the autocorrelation function would depend on the time points s and t only through the difference 
|t — s|. Thus, X(t) is WSS. 


5.16. Let (X,, n > 0} be a sequence of iid r.v.’s with mean 0 and variance 1. Show that {X,, n > 0} is 
a WSS process. 


By Eq. (5.90), 
Е(Х,) = 0 (constant) for all n 
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and by Eq. (5.91), 
EIX JE(X 454) 20 k #0 


Ri +b) = АХ, XS) = fry 2) = Var(X) = | k=0 


which depends only on k. Thus, {X,} is a WSS process. 


5.17. Show that if a random process X(t) is WSS, then it must also be covariance stationary. 


If X(t) is WSS, then 
E[X(0] = џи (constant) for all t 


Ry(t, t + t)] = Ry(x) for all t 
Now Ky(t, + т) = Cov[X()X(t + 0] = Ry(t t + 0 — E[X()]EDX(G + 0] 
= Ry(1) – p? 


which indicates that K y(t, t + т) depends only on т; thus, X(t) is covariance stationary. 


5.18. Consider a random process X(t) defined by 
X(t) = О cos wt + V sin ot —20 <1< 00 (5.93) 
where c is constant апа U and V аге r.v.’s. 
(a) Show that the condition 
E(U) = Е(У) = 0 (5.94) 
is necessary for X(t) to be stationary. 
(b) Show that X(t) is WSS if and only if U and V are uncorrelated with equal variance; that is, 
E(UV) = 0 E(U?) = E(V?) = о? (5.95) 
(a) Now 
р) = E[X(0] = E(U) cos wt + E(V) sin wt 


must be independent of t for X(t} to be stationary. This is possible only if y,(t)=0, that is, 
E(U) = E(V) = 0. 
(b) 1f X(t) is WSS, then 


ЕГХ?0) = є| (2) = Ryx(0) = су? 
But X(0) = U and X(z/20) = V ; thus 
E(U?) = E(V2) = сұ? = о? 
Using the above result, we obtain 


Rot, + x) = E[X(OX(t + т)] 
= E{{U cos wt + V sin wt)[U cos w(t + x) + V sin w(t + т)]} 
= ø? cos wt + E(UV) sin(2wt + wr) (5.96) 


which will be a function of t only if E(UV) = 0. Conversely, if E(UV) = 0 and E(U?) = E(V?) = 0°, 
then from the result of part (а) and Eq. (5.96), we have 


пуй) = 0 
Ry(t, t т) = о? cos wt = Кү(т) 


Hence, X(t) is WSS. 
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5.19. 


5.20. 


5.21. 


Consider a random process X(t) defined by 
X(t) = Ucost+V sint —o <1< 00 


where О апа V are independent r.v.’s, each of which assumes the values —2 and 1 with the 
probabilities 4 and 4, respectively. Show that X(t) is WSS but not strict-sense stationary. 


We have 


E(U) = E(V) = 4—2) + $(1) = 0 
E(U?) = EV?) = $( 2n + 301) = 2 


Since U and V are independent, 
E(UV) = E(U)E(V) = 0 
Thus, by the results of Prob. 5.18, X(t) is WSS. To see if X(t) is strict-sense stationary, we consider E[X *(t)]. 


E[X3(t)] = E[(U cos t + V sin t)?] 
= E(U?) cos? t + 3E(U?V) cos? t sin t + 3E(UV?) cos t sin? t + E(V?) sin? t 


Now E(U?) = E(V*) = $(—2)3 + 41)? = —2 
E(U?V) = Е(О?)Е(И) = 0 E(UV?) = E(U)E(V?) 2 0 
Thus ELX] = — 2(cos? t + sin? t) 


which is a function of t. From Eq. (5.16), we see that all the moments of a strict-sense stationary process 
must be independent of time. Thus X(t) is not strict-sense stationary. 


Consider a random process X(t) defined by 
X(t) = A cos(@t + Ө) —oo«t« oo 
where A and o are constants and © is a uniform r.v. over (— n, л). Show that X(t) 15 WSS. 
From Eq. (2.44), we have 


—n<@<n 
fe(8) = \2n 
0 otherwise 
A л 
Тһеп ux(t) = on | cos(wt + 8) 40 = 0 (5.97) 


Setting s = t + t in Eq. (5.7), we have 


A? л 

Ryx(t, t+ t) = on | cos(@t + 8) cos[w(t + т) + 8)] dé 
A? f" 1 

-—— 5 [cos от + cos(2wt + 20 + от)] 40 


=r 


= — COS WT (5.98) 


Since the mean of X(t) is a constant and the autocorrelation of X(t) is a function of time difference only, we 
conclude that X(t) is WSS. 


Let {X(t), t = 0) be a random process with stationary independent increments, and assume that 
X(0) = 0. Show that 


E[X()] = uit (5.99) 
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where u, = E[X(1)]. 


Let ГО) = E[X()] = ELX() — Х(0)] 


Then, for any t and s and using Eq. (4.108) and the property of the stationary independent increments, we 
have 


fit+ s) = E[X(t + з) — X(0)] 
= E[X(t + s) — X(s) + X(s) — X(0)] 
= E[X(t + s) — X(s)] + E[X(s) — X(0)] 
= E[X(t) — X(0)] + E[X(s) — X(0)] 
= f(t) + f(s) (5.100) 


The only solution to the above functional equation is f(t} = ct, where c is a constant. Since с = f(1) = 
E[ X(1)], we obtain 


ELX@] =a m = ЕГХ()] 


Let (X(t), t > 0) be a random process with stationary independent increments, and assume that 
X(0) = 0. Show that 


(a) Var[X(t)] = o,?t (5.101) 
(b) Var[ X(t) — X(s)] = о t —3 — t»s (5.102) 
where o,? = Var[X(1)]. 


(a) Let g(t) = Var X(t)] = Var X(t) — X(0)] 
Then, for any t and s and using Eq. (4.112) and the property of the stationary independent increments, 
we get 
g(t + s) = Var[X(t + s) — X(0)] 
= Var[X(t + s) — X(s) + X(s) — X(0)] 
= Var(X(t + s) — X(s)] + Var(X(s) — X(0)] 
= Var(X(t) — X(0)] + Var[X(s) — X(0)] 
= g(t) + g(s) 


which is the same functional equation as Eq. (5.100). Thus, g(t) = kt, where k is a constant. Since 
k = g(1) = Var[X(1)], we obtain 


Var[X(t)] = 0,78 а? = Var[X(1)] 
(b) Lett s. Then 


Var[X(t] = Var[X(t) — X(s) + X(s) — X(0)] 
= Var[ X(t) — X(s)] + Var[X(s) — X(0)] 
= Var[X(t) — X(s)] + Var[X(s)] 


Thus, using Eq. (5.101), we obtain 
Var[X(t) — X(s)] = Var[X(r)] — Var[X(s)] = o,?(t — s) 


Let (X(t), t > 0) be a random process with stationary independent increments, and assume that 
X(0) = 0. Show that 


Cov[ X(t), X(s)] = Ky(t, s) = o,? min(t, s) (5.103) 
where с, = Var[X(1)]. 
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By definition (2.28), 


Var[ X(t) — X(s)] = E({X(t) ~ X(s) - ELX(t) — X(])?) 

= E[(X() — E[X(t)]} — {X(s) — ELX(G]?] 
= E({X(t) — E[X()])? — 2(X(0 — ELX()]H(X(s) — ELX(]) + {X(s) — ELX(s)]}?) 

= Var[X(t)] — 2 Cov[ X(t), X(s)] + Уаг[ Х(5)] 

Thus, Cov[X(t), X(s)] = 4{Var[X()] + Уаг[ Х(5)] — Var[ X(t) — X(s)]} 

Using Eqs. (5.101) and (5.102), we obtain 
fte s-(t-3]- 07s >з 
К, DU =o,*t s>t 


or K y(t, s) = 0,7 min(t, s) 


where с? = Var[X(1)]. 


5.24. (a) Show that a simple random walk X(n) of Prob. 5.2 is a Markov chain. 
(b) Find its one-step transition probabilities. 
(a) From Eq. (5.73) (Prob. 5.10), X(n) = (X,, n = 0} can be expressed as 


where Z, (n = 1, 2, ...) are iid r.v.'s with 
P(Z,-k)-a, (К = 1, —1) апа ар = р а ј=4= 1-р 
Then X(n) = (X,, n z 0} is a Markov chain, since 


Р(Х, = | Xo 50, Xi =, ..., Х, =) 
= Р(2,+1 +i, = 1,11 Хо = 0, X, = i,e Xn = ip) 
= Р(2,+1= insi 1,) = а; „= Р(Х, = ing | Xn =) 


dn 17 і 


since Z,,, is independent of Xo, X,,..., Xn 
(b) The one-step transition probabilities are given by 


p k=j+1 
рк = P(X,=k)X,-,=f)=\qel—-p k=j-1 
0 otherwise 


which do not depend on n. Thus, a simple random walk X(n) is a homogeneous Markov chain. 


5.25. Show that for a Markov process X(t), the second-order distribution is sufficient to characterize 
X(t). 
Let X(t) be a Markov process with the nth-order distribution 
F(X X2, X45 lp £9, 5) = P{X(t,) € ху, X (tq) € x2, ..., Х() € x, 
Then, using the Markov property (5.26), we have 


Fy(Xi, X3... Xy Lys 09, 5 t) = PIX) € х„| X(t) S xy, X(t) € x5, ..., Xt, € x,-i) 
x P(X(t) € xy, Xt) € x3, ..., X(t,.)) € x41] 
= P{X(t,) < х„|Х(@„- |) < х„-1}Еу(Х\, ttt. Xn-14; lis 52] 1-1) 


Applying the above relation repeatedly for lower-order distribution, we can write 


Ех, X5, Xai tis £2, s Pn) = Рб tj) П Р(Х) < Xil X(t, 1) € xv-i] (5.104) 
к=2 
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Hence, all finite-order distributions of a Markov process can be completely determined by the second-order 
distribution. 


Show that if a normal process 1s WSS, then it is also strict-sense stationary. 


By Eq. (5.29), a normal random process X(t) is completely characterized by the specification of the 
mean E[X(t)] and the covariance function К x(t, s) of the process. Suppose that X(t) is WSS. Then, by Eqs. 
(5.21) and (5.22), Eq. (5.29) becomes 


V yup xov +, ©) = exp $ Y uo — = > Y K xlt; — - noon} (5.105) 
i= 1 k=1 
Now we translate all of the time instants ¢,, t3, ..., t, by the same amount т. The joint characteristic 
function of the new r.v.’s X(t; + т), = 1, 2, ..., n, is then 


V xt te xut 0) = zi Y uo; — У Kt + т —(t, + то; 2| 


i=l 2 ї= 1 k=1 
п 1 n 
= өр) Уро; — 2. Y Ky, ~ t), 2| 
i=. i=l k=l 
= VW yay xe o0, (5.106) 


which indicates that the joint characteristic function (and hence the corresponding joint pdf) is unaffected by 
a shift in the time origin. Since this result holds for any n and any set of time instants (t; є T, i= 1, 2, ..., n), 
it follows that if a normal process is WSS, then it is also strict-sense stationary. 


Let {X(t}, —oc «t < оо} be a zero-mean, stationary, normal process with the autocorrelation 
function 
|z| 


1-— -~-T<t<T 
Кт) = T 215 (5.107) 


0 otherwise 


Let {X(t,), i= 1, 2, ..., n) be a sequence of n samples of the process taken at the time instants 


T 
ais і = 1, 2,...,п 
Find the mean and the variance of the sample mean 
1 п 
Ё, = = Y X(t) (5.108) 
i=] 
Since X(t) is zero-mean and stationary, we have 
E[X(t)] = 0 
T 
and Ry(t;, t) = ELX(t)X(t,)] = Күп, — t) = 0 — i) a 
Thus Е(Ӣ,) = | X X(t | - i E[X(t)] 2 0 (5.109) 
and Уаг(й,) = Е{[2, — Е(Д„)]?} = E(A,”) 


- Js Exo: ix 


1 Ш n 1 п п 
ту LEXON = Y Y a —% 7] 


CHAP. 5] RANDOM PROCESSES 185 


By Eq. (5.107), 


1 k=i 
Кук – 97/2) = 41  |k-i|-1 
0 |-> 2 
Thus Var(ji,) = 5 [n(1) + 2(n — 1)(4) + 0] = E Qn — 1) (5.110) 


DISCRETE-PARAMETER MARKOV CHAINS 


5.28. Show that if P is a Markov matrix, then P" is also a Markov matrix for any positive integer n. 


Pi Piz Pim 
Let P= ҮЛ |P: Ра Pam 
Pmi Pm2 UU Рът 


Then by the property of a Markov matrix [Eq. (5.35)], we can write 


Pu Prz Pim ||! ! 

Dia. Рэз Pam || 1 - 1 

Prat Pmt Ps SL! ! 
or Pa=a (5.111) 
where а= (11 1 = 1] 


Premultiplying both sides of Eq. (5.711) by P, we obtain 
P?a = Pa =a 
which indicates that P? is also a Markov matrix. Repeated premultiplication by P yields 
P"a =a 


which shows that P” is also a Markov matrix. 


8.29. Verify Eq. (5.39); that is, 
p(n) = p(0)P" 


We verify Eq. (5.39) by induction. If the state of X, is i, state X, will be j only if a transition is made 
from i to j. The events (X, = i, i = 1, 2, ...) are mutually exclusive, and one of them must occur. Hence, by 
the law of total probability [Eq. (7.44)], 


P(X, = ) =}, Р(Х = Р(Х, =j|Xo =i) 
or рК) =E Pp; }=1,2,... (5.112) 
In terms of vectors and matrices, Eq. (5.112) can be expressed as 


p(1) = р(О)Р (5.113) 
Thus, Eq. (5.39) is true for n — 1. Assume now that Eq. (5.39) is true for n — k; that is, 


p(k) = p(0)P* 
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Again, by the law of total probability, 
Р(Х. =) = У Р(Х, = Р(Х, = HX =й) 


or РАК + l= Y p(p; =f =1,2,... (5.114) 


In terms of vectors and matrices, Eq. (5.114) can be expressed as 
p(k + 1) = p(k)P = p(0)P*P = p(0)P**' (5.115) 
which indicates that Eq. (5.39) is true for k + 1. Hence, we conclude that Eq. (5.39) is true for all n > 1. 


5.30. Consider a two-state Markov chain with the transition probability matrix 


1-а а 
p=| b un" 0б0<а<1,0<Ь<1 (5.116) 


(a) Show that the n-step transition probability matrix Р” is given by 


„__1 b а | 8 —a 
к=! tea-esp:cq бил 


(b Find P" when n > oo. 





(a) From matrix analysis, the characteristic equation of P is 

À — (1 — a) —a 
—b A—(1— b) 

={(4—1)(4—1+а+Ь)=0 


ca) =|А1—Р|= 





Thus, the eigenvalues of P are А, = | and 4; = 1 — a — b. Then, using the spectral decomposition 
method, P" can be expressed as 


Р" = A,"E, + АЕ, (5.118) 


where E, and ЕЁ, are constituent matrices of P, given by 


E, = 





[P — А, E; [P — AM] (5.119) 


ThA 


A, = 4, 


Substituting 4, = 1 and A, = 1 — а — Б in the above expressions, we obtain 


1 b а 1 a —a 
Е, = Е, =—— 
! zl ‘| ? zal М 


Thus, by Eq. (5.118), we obtain 
Р" = Е, + (1 – а – БРЕ, 


=- d^ "lia y| 7 5.120) 
Casbi|b a a -b b б. 
(b) IfO0<a<1,0<b<1,then0<1—a<1and|!—a-—b|<1.Solim,.,(1 — a — b = 0 and 


1 b a 
lim Р" = —— 512 
n b "| (5.421) 








Note that a limiting matrix exists and has the same rows (see Prob. 5.47). 


5.31. Ап example of a two-state Markov chain is provided by a communication network consisting of 
the sequence (or cascade) of stages of binary communication channels shown in Fig. 5-9. Here X, 
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=] l-b X =} 
Fig. 5-9 Binary communication network. 


denotes the digit leaving the nth stage of the channel and X, denotes the digit entering the first 
stage. The transition probability matrix of this communication network is often called the 
channel matrix and is given by Eq. (5.116); that is, 


l—a a 
r-l b i" 0<а<1,0<Ь<1 


Assume that a = 0.1 and b = 0.2, and the initial distribution is P(X, = 0) = P(X, = 1) = 0.5. 


(a) Find the distribution of X,. 
(b) Find the distribution of X, when n > oo. 


(a) The channel matrix of the communication network is 
9 Q. 
p- 09 0.1 
0.2 0.8 
and the initial distribution is 


p(0) = [0.5 0.5] 


By Eq. (5.39), the distribution of X, is given by 


09 O17 
p(n) = p(0)P^ = [0.5 eso M 


Letting a = 0.1 and b = 02 in Eq. (5.117), we get 








[9 s] ED [оз 0.1 | 01 —01 
lo: s] ~ 03 [оз ul 03 | —0.2 0.2 
© (0.7)" 1 — (0.7)" 
3 
2- n 2 ~ 200.7)" 1+ 1 + 200.7) 
3 
Thus, the distribution of Х, is 
24(07" 1 (0.7) 
Pin) = [0.5 0.5] 


2-207)" 1+ 200.7)" 
3 


.[2 (m 1, 027 
-f 6 SET 
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that is, 
2 (07у 1 (07у 
P(X, = 0) == – d РХ, = 1) == 
(Х, = 0) 3 6 an (X, = 1) 3 6 


(b) Since іт, .. „.(0.7)" = 0, the distribution of X, when n — oo is 


PX, = 0) =2 and Р(Х, = 1) = 


Verify the transitivity property of the Markov chain; that is, if i > j and j > k, then i К. 


By definition, the relations i — j and j э k imply that there exist integers n and m such that р,” > 0 
and p, > 0. Then, by the Chapman-Kolmogorov equation (5.38), we have 


pm = Y p, pi" > рур" > 0 (5.122) 


Therefore i > k. 


Verify Eq. (5.42). 


If the Markov chain [ X,) goes from state i to state j in m steps, the first step must take the chain from i 
to some state k, where k # j. Now after that first step to k, we have m — 1 steps left, and the chain must get 
to state j, from state k, on the last of those steps. That is, the first visit to state j must occur on the (m — 1)st 
step, starting now in state k. Thus we must have 

fif" = У Pa Gp m= 2, 3,... 


кё) 


Show that in a finite-state Markov chain, not all states can be transient. 


Suppose that the states are 0, 1, ..., т, and suppose that they are all transient. Then by definition, after 
a finite amount of time (say Tọ), state 0 will never be visited; after a finite amount of time (say 7,), state 1 
will never be visited; and so on. Thus, after a finite time T = max{To, T, ..., Ты}, no state will be visited. 
But as the process must be in some state after time T, we have a contradiction. Thus, we conclude that not 
all states can be transient and at least one of the states must be recurrent. 


A state transition diagram of a finite-state Markov chain is a line diagram with a vertex corre- 
sponding to each state and a directed line between two vertices i and j if pj; > 0. In such a 
diagram, if one can move from i and j by a path following the arrows, then i > j. The diagram is 
useful to determine whether a finite-state Markov chain is irreducible or not, or to check for 
periodicities. Draw the state transition diagrams and classify the states of the Markov chains 
with the following transition probability matrices: 


0 05 05 Н A °з °з 
(a) Р=|05 0 05 (b P= 
0.5 0.5 0 9 1 0 0 
0 1 0 0 
03 04 0 0 0.3 
0 1 0 0 0 
(с) P=] 0 0 0 06 04 
0 0 0 0 1 
0 0 1 0 0 
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(a) (5) (о) 


Fig. 5-10 State transition diagram. 


(а) The state transition diagram of the Markov chain with P of part (а) is shown in Fig. 5-10(a). From Fig. 
5-10(a), it is seen that the Markov chain is irreducible and aperiodic. For instance, one can get back to 
state 0 in two steps by going from 0 to | to 0. However, one can also get back to state 0 in three steps 
by going from 0 to 1 to 2 to 0. Hence 0 is aperiodic. Similarly, we can see that states 1 and 2 are also 
aperiodic. 

(b) The state transition diagram of the Markov chain with P of part (b) is shown in Fig. 5-10(b). From Fig. 
5-10(b), it is seen that the Markov chain is irreducible and periodic with period 3. 

(c) The state transition diagram of the Markov chain with P of part (c) is shown in Fig. 5-10(c). From Fig. 
5-10(с), it is seen that the Markov chain is not irreducible, since states 0 and 4 do not communicate, 
and state 1 is absorbing. 


5.36. Consider a Markov chain with state space (0, 1) and transition probability matrix 


n m 
vue © 
LILJ 


(a) Show that state 0 is recurrent. 
(b) Show that state 1 is transient. 
(a) By Eqs. (5.41) and (5.42), we have 
foo = Poo = 1 fio" = Pio =4 


foo” = роц fio” -(0)0 = 0 
foo” = 0 п> 2 


Then, by Eqs. (5.43), 
foo = Р(Т < о|Х, = 0) = Y foo" 21-00-21 
n=0 


Thus, by definition (5.44), state 0 is recurrent. 
(b) Similarly, we have 
fu? p, 71 foi? = ро = 0 
fir? = pio for? = (00 = 0 
Г" =0 n22 
апа fu = РТ < [Xp = 1) = У "е5 +0+0 += 51 
п= 0 


Thus, by definition (5.48), state 1 is transient. 
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5.37. Consider a Markov chain with state space {0, 1, 2} and transition probability matrix 


0 $ 1 
P-[1 0 0 
1 0 0 
Show that state 0 is periodic with period 2. 
The characteristic equation of P is given by 
A -3 7-3 
«4)2]41— P| = | —1 À 0 |= 42 -2=0 
—1 0 À 


0 + +10 2 3 1 0 0 
P°»=P=|]) 0 01 0 oļ=|0 ài + 
1 o ojlı o of [o 23 1 
0 3 3 
pr*oop-|i 0 0 
1 0 0 
Therefore 40) = ged{n > 1: poo > 0} = ged{2, 5, 6,...) = 2 


Thus, state 0 is periodic with period 2. 
Note that the state transition diagram corresponding to the given P is shown in Fig. 5-11. From Fig. 


5-11, it is clear that state 0 is periodic with period 2. 


hNI— 
nie 


Fig. 5-11 


5.38. Let two gamblers, A and B, initially have k dollars and m dollars, respectively. Suppose that at 
each round of their game, À wins one dollar from B with probability p and loses one dollar to B 
with probability g = 1 — p. Assume that A and B play until one of them has no money left. (This 
is known as the Gambler's Ruin problem.) Let X, be A's capital after round n, where n = 0, 1, 
2,...and Xo =k. 


(а) Show that X(n) = (X,, n > 0} isa Markov chain with absorbing states. 
(b) Find its transition probability matrix P. 
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(а) The total capital of the two players at all times is 
k+m=N 
Let 2, (n > 1) be independent r.v.’s with P(Z, = 1) = p and P(Z, = – 1) = ў = 1 — p for all n. 
Then 
X,=X,-,+Z, п= 1, 2,... 


and X, = К. The game ends when X, = 0 or X, = №. Thus, by Probs. 5.2 and 5.24, X(n) = {Х„,п 2 0} 
is a Markov chain with state space E = {0, 1, 2,..., N}, where states 0 and N are absorbing states. The 
Markov chain X(n) is also known as a simple random walk with absorbing barriers. 


(b) Since 
Piiri = Р(Х, =i + 1[X, =) = р 
Pi.i-1 = P(X,., =i-1[X,==q 
Pii = Р(Х, = 1X, = ф = 0 i#0, N 
Po.o = P(X,., =0|Х„ = 0) = 1 


Рм. м = Р(Х, .1 = МХ, = № = 1 


the transition probability matrix Р is 


1 0 0 0 
q p 0 0 
0 0 0 
Р = | : EN : (5.123) 
ооо о q 0 p 
0 0 0 0 0 0 1 


For example, when p = q = + and N = 4, 


10000 
20100 
Р=|0 4 0 1 0 
00 10 4 
00001 


5.39. Consider a homogeneous Markov chain X(n) = (X,, n = 0} with a finite state space E = (0, I, 
..., N}, of which A = {0, 1,..., mj, m > 1, is a set of absorbing states and B = {m + 1,..., N} is 
a set of nonabsorbing states. It is assumed that at least one of the absorbing states in A is 
accessible from any nonabsorbing states in B. Show that absorption of X(n) in one or another of 
the absorbing states is certain. 


If X, E A, then there is nothing to prove, since X(n) is already absorbed. Let X, e B. By assumption, 
there is at least one state in A which is accessible from any state in B. Now assume that state k € A is 
accessible from j є B. Let nj, (< со) be the smallest number n such that pj, > 0. For a given state j, let n; 
be the largest of n5, as k varies and п be the largest of n; as j varies. After n' steps, no matter what the initial 
state of X(n), there is a probability p > 0 that X(n) is in an absorbing state. Therefore 

P(X, € В} =1—p 
and 0 < 1 — p < 1. It follows by homogeneity and the Markov property that 


PEX een E В) -(1—p*  К=1,2,... 
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Now since lim,., ,(1 — р) = 0, we have 


lim P{X,¢ B] 20 ог lim P(X,€ B= A} = 1 


п» ш nc 


which shows that absorption of X(n) in one or another of the absorption states is certain. 


Verify Eq. (5.50). 


Let X(n) = (X,, n = 0} be a homogeneous Markov chain with a finite state space E = (0, 1,..., N}, of 
which A = (0, 1, ..., m, m > 1, is a set of absorbing states and B = (m + 1,..., N} isa set of nonabsorbing 
states. Let state k € B at the first step go to i e E with probability p,,. Then 

uy; = Р(Х, = j(e A)| Xo = ke B)) 


1 


N 
= Y pu P(X, = є А)| Xo = i} (5.124) 
i=1 
1 i=j 
Now P(X, =j(€ A, Xy = i] 240 іє A,i Fj 
и іє В,і= т + 1,..., № 


Then Eq. (5.124) becomes 


N 
uy m Dt È Puy k=m+1,..., N;j=1,... т (5.125) 


і=т+ 1 


But р, Кет + 1,..., N;j = 1, ..., m are the elements of К, whereas pi, k 2 m + 1,..., №і= т + 1,..., 
М are the elements of Q [see Ед. (5.49a)]. Hence, in matrix notation, Eq. (5.125) can be expressed as 


U=R+QU or (I—Q)U-R (5.126) 
Premultiplying both sides of the second equation of Eq. (5.126) with (I — Q)~', we obtain 
U-(I—-Q)!R-9R 


Consider a simple random walk X(n) with absorbing barriers at state 0 and state N = 3 (see 
Prob. 5.38). 


(a) Find the transition probability matrix P. 
(b) Find the probabilities of absorption into states 0 and 3. 


(a) The transition probability matrix P is [Eq. (5.123)] 


0 1 2 3 

011 0 0 0 
p=! q 0 p 0 
210 а 0 p 
3}0 0 0 1 


(b) Rearranging the transition probability matrix P as [Eq. (5.49a)], 


0 3 ] 2 
от 0 0 0 
3|0 1 0 о 
P= 
а 0 0 p 
210 p д Q 
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5.42. 


5.43, 


and by Eq. (5.49b), the matrices Q апа К are given by 


к=|г SEF И o- |^ 2-9 { 
20 P23 0 p P23 Pa q 0 


Il = 
Then i-o-| r] 





and Ф010) = — [) ] (5.127) 


By Eq. (5.50), 








1 fi 0 ? 
u- Hn M -R= | 1 Е | __! | р | (5.128) 
изо Из 1—pqalq !JLO p] 1-pqlq’ p 


Thus, the probabilities of absorption into state 0 from states | and 2 are given, respectively, by 


2 








ujo = and uzo = 


1 — pq 


and the probabilities of absorption into state 3 from states 1 and 2 are given, respectively, by 


| — pq 


2 





цуз = and Uz3 





1 — pq | l-pq 
Note that 
q+  1—p+p? 
що uy = I—M— = — ——-] 
05 71 t= pq — pl p) 
2 2 
+ +(1— 
ио + иу = 9121224 0-ф _, 


1—рд 1—(1-ф4_ 
which confirm the proposition of Prob. 5.39. 


Consider the simple random walk X(n) with absorbing barriers at 0 and 3 (Prob. 5.41). Find the 
expected time (or steps) to absorption when X, = 1 and when X, = 2. 


The fundamental matrix Ф of X(n) is [Eq. (5.1 27)] 


_ фу, Qiz _ 1 $ r] 
? [$ s] l—pqiq 1 


Let T, be the time to absorption when X, =i. Then by Eq. (5.51), we get 








1 
Е(Т,) = 1— pg (1 + p) Е(Т,) = (q+ 1) (5.129) 


1 — pq 


Consider the gambler’s game described in Prob. 5.38. What is the probability of A’s losing all his 
money? 


Let P(k), К = 0, 1, 2, ..., М, denote the probability that A loses all his money when his initial capital is 
k dollars. Equivalently, P(k) is the probability of absorption at state 0 when X, = К in the simple random 
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walk X(n) with absorbing barriers at states 0 and N. Now if 0 < k < N, then 
P(k) = рР(К+1)+4Р(К—1) k=1,2,...,N—1 (5.130) 


where pP(k + 1) is the probability that A wins the first round and subsequently loses all his money and 
qP(k — 1) is the probability that A loses the first round and subsequently loses all his money. Rewriting Eq. 
(5.130), we have 


1 
Pk + N= PUR) +Í Pk- 1) =0 k=1,2,...,N—1 (5.131) 


which is a second-order homogeneous linear constant-coefficient difference equation. Next, we have 
Р(0) = 1 апа P(N) = 0 (5.132) 


since if К = 0, absorption at 0 is a sure event, and if k = №, absorption at № has occurred and absorption at 
0 is impossible. Thus, finding P(k) reduces to solving Eq. (5.131) subject to the boundary conditions given 
by Eq. (5.132). Let P(k) = г. Then Eq. (5.131) becomes 


1 
prt in ky Tmt ш) ptq-l 
p p 


Setting k = | (and noting that p + q = 1), we get 


1 
r—-r+i=(r ц, 2) =o 
р р р 


from which we get r = 1 and r = q/p. Thus, 





k 
P(k) = e, + a(2) q*p (5.133) 


where c, and c; are arbitrary constants. Now, by Eq. (5.132), 


Р(0) = 1-с + = 1 
q N 
P(N) =0¢, ««(1) -0 


Solving for c, and c,, we obtain 





oz —(q/p)" os 
‘1 = (9/р)" * 1 = (a/p 
k N 
Hence P(k) = talpi — (8р). q*p (5.134) 
| — (4/р) 
Note that if N > k, 
1 а>р 
P(k) = (9 (5.135) 
- pdq 
р 
Setting r = q/p in Eq. (5.134), we have 
r* — rh k 
P(k) = -—1-— 
(9 ]p-r* cn N 
Thus, when p =q = 3, 
P(k) = 1 k (5.136 
= N . ) 


5.44. Show that Eq. (5.134) is consistent with Eq. (5.128). 
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5.48. 


Substituting k = 1 and N = 3 in Eq. (3.134), and noting that р + q = 1, we have 


(a/p) — (a/p _ ар? — 4?) 
1 —(q/p)’ (р? – 47) 





Р(1) = 


Now from Eq. (5.128), we have 


q 
= —— = Р(| 
Чуо 1 рд (1) 


Consider the simple random walk X(n) with state space E = (0, 1, 2, ..., N}, where 0 and N are 
absorbing states (Prob. 5.38). Let r.v. Т, denote the time (or number of steps) to absorption of 
X(n) when X, = k, k = 0,1, ..., N. Find Е(Т,). 


Let Y(k) = E(T,). Clearly, if k = 0 or k = N, then absorption is immediate, and we have 


Ү(0) = Y(N) 20 (5.137) 
Let the probability that absorption takes m steps when X, = k be defined by 
P(k, m) = P(T, = т) m=1,2,... (5.138) 
Then, we have (Fig. 5-12) 
P(k, m) = pP(k + 1, m — 1) + qP(k — 1, m— 1) (5.139) 
and Y(k) = ET) = У mP(k, т) = рУ трк +1, т 1) +4 У mP(k 1, m — 1) 
m=1 те | m-1 
Setting m — 1 — i, we get EU 


¥(k) = p Y (i+ Рк + 1, i) + qY (i+ DP(k — 1,9 
i=0 i=0 
Y iP — 1, i) + pY Plk + Lj) qY Pik- 1, i) 
i=0 i=0 i20 


= pYiP(k 1,09 +4 
г=0 


пй a рр 





0 1 2 3 k n 
Fig. 5-12 Simple random walk with absorbing barriers. 
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Now by the result of Prob. 5.39, we see that absorption is certain; therefore 


Y Pk +1, i) = Y Pk —1, i) = 1 


i20 i-0 
Thus Y(k) = pY(k - 1) + qYk —D- pq 
or Y(k) = pY(k + 1) + gYik— 1) +1 k=1,2,....N-1 (5.140) 
Rewriting Eq. (5.140), we have 
1 q 1 
Y(k + 1)—- Y(k) ++ Ү(К—1)=—- (5.141) 
p p p 


Thus, finding P(k) reduces to solving Eq. (5.141) subject to the boundary conditions given by Eq. (5.137). 
Let the general solution of Eq. (5.141) be 


Y(k) = Y(k) + ҮК) 


where Y,(k) is the homogeneous solution satisfying 
1 
Wk + )—-— Y) +1 Y(k ~ 1) =0 (5.142) 
and ҮК) is the particular solution satisfying 
1 4 1 
Y(k +1)—- Y() +- Yk- 1) = —- (5.143) 
р р р 
Let Y,(k) = ak, where a is a constant. Then Eq. (5.143) becomes 
1 1 
(Kk Va —- ka S (к 1a = —- 
p р р 


from which we get а = l/(q — p) and 


k 
Y,(k) = p#q (5.144) 
q—P 
Since Eq. (5.142) is the same as Eq. (5.131), by Eq. (5.133), we obtain 
q k 
ҮК) = с, + «() а#р (5.145) 
where c, and c, are arbitrary constants. Hence, the general solution of Eq. (5.141) is 
qY, k 
Y() = c, + a2) +—— q#p (5.146) 
p 4=p 


Now, by Eq. (5.137), 
Y(0)=O0->c,+c¢,=0 


"NON 
vi - 0 +42) +——=0 


Solving for c, and с, we obtain 


2o LN - р) c, 2 М9 Р) 
"1 = (9/р)“ 2 d = (q/p)® 
Substituting these values in Eq. (5.146), we obtain (for р з q) 
І t= ater |) 
Y(k) = ET) = —— | k — N| ——— 5.147 
(= Ah) = 2 | ig 647) 


When p = q = 5, we have 
Y(k) = ЕТ) = ҚМ —-k р=д= 5 (5.148) 


CHAP. 5] RANDOM PROCESSES 


5.46. Consider a Markov chain with two states and transition probability matrix 


5.47. 


(b) Now Р" = | 1 
i 


(a) Find the stationary distribution p of the chain. 
(b) Find lim,- Р". 


(a) By definition (5.52), 


0 I! 


or [Pi Z Е P2] 


which yields p, = р. Since p, + p; = 1, we obtain 


and іт, _.„ P" does not exist. 


Consider a Markov chain with two states and transition probability matrix 


| 


n al 


(a) Find the stationary distribution p of the chain. 
(b) Find lim,.,,, Р". 
(c) Find lim,- P" by first evaluating P". 


(a) By definition (5.52), we have 


3 1 
or [Pi Р] |} i = [ру Pi 
2 2 


which yields 
ih + 4p. =P, 


EP; + 3р = рз 


Each of these equations is equivalent to p, = 2p;. Since p, + p; = 1, we obtain 


B-[5 3] 


(b) Since the Markov chain is regular, by Eq. (5.53), we obtain 


whe Ы 


lim P" = lim | 


п no 


we ode 
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(c) Setting a = 4 and b = + in Eq. (5.120) (Prob. 5.30), we get 


NE po 
Р' = $ |-Ф] 2 | 
3 3 -3 3 


Since lim,..,, (2)" = 0, we obtain 


POISSON PROCESSES 


5.48. Let Т, denote the arrival time of the nth customer at a service station. Let Z, denote the time 
interval between the arrival of the nth customer and the (n — 1)st customer; that is, 


=D- nt (5.149) 


n=l 
and T, = 0. Let (X(t), t > 0} be the counting process associated with {Т,, n > 0}. Show that if 
X(t) has stationary increments, then Z,,n = 1, 2, ..., are identically distributed r.v.’s. 


We have 
P(Z, > 2) = 1 — P(Z, <2) =1— Е, (2) 
By Eq. (5.149), P(Z, > 2) = Р(Т,— T, > 2) = P(T, > Т, + 2) 
Suppose that the observed value of Т, _ у ist,-,. The event (Т, > T, + z|7,_, = („- у) occurs if and only if 


X(t) does not change count during the time interval (t - ү, t, ., + 2) (Fig. 5-13). Thus, 


P(Z, > 2|Т,- = 1,1) = Р(Т, > h-i +2|T,-; = 1,1) 
= P[X(t,_, + 2) — X(t,.,) = 0] 


ог P(Z, «z|T, =t,_,)=1 PEX, z) — Xt, 1) = 0] (5.150) 


Since X(t) has stationary increments, the probability on the right-hand side of Eq. (5.150) is а function only 
of the time difference z. Thus 





Р(2, < 2]1,-1=1,-1) = 1 — P[XG) = 0] (5.151) 


which shows that the conditional distribution function on the left-hand side of Ед. (5.151) is independent of 
the particular value of n in this case, and hence we have 


F,(2) = P(Z, < 2) = 1 — P[X(z) = 0] (5.152) 


which shows that the cdf of Z, is independent of п. Thus we conclude that the Z,’s are identically distrib- 
uted r.v.'s. 


Fig. 5-13 


5.49. Show that Definition 5.6.2 implies Definition 5.6.1. 
Let p,(t) = P[X(t) 2 n]. Then, by condition 2 of Definition 5.6.2, we have 


polt + At) = P[X(t + At) = 0] = PLX() = 0, X(t + At) — X(0) = 0] 
= P[X(t) = O]P[X(t + М) — X(t) = 0] 
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Now, by Eq. (5.59), we have 
P[X(t + Ar) - X(t) 20] = 1 — 4 At + ofAd 


Thus, Polt + At) = po(O[1 — A At + o(At)] 
polt + At) роб) _ ом) 
ог ea = ер) + 5 


Letting At — 0, апа by Eq. (5.58), we obtain 


pot) = —Apolt) (5.153) 
Solving the above differential equation, we get 
polt) = Ке! 
where k is an integration constant. Since р,(0) = Р[Х(0) = 0] = 1, we obtain 
pot) = е“ (5.154) 


Similarly, for п > 0, 

palt + At) = Р[Х + At) = n] 

= P[X(t) =n, X(t + Ar) - X(0) = 0] 
+ P[X() 2 n — 1, X(t + An — X(0 = 1] + У PLX(Q =n к X(t + At) ~ X(0) = К] 
k=2 
Now, by condition 4 of Definition 5.6.2, the last term in the above expression is o(At). Thus, by conditions 2 
and 3 of Definition 5.6.2, we have 
рї + At) = p EL — A At + o(At)] + p, ,(t)[A At + o(At)] + o( Ad) 


Ap = Ap,(t) + Ар, - I(t) + At 


Thus 

and letting At > 0 yields 
Palt) + Ар!) = Ap, 1(0) (5.155) 

Multiplying both sides by e^, we get 


eMLpi(t) + APD] = дер, s(t) 
d, | 
Непсе di [ep t] = Ae" p, — y(t) (5.156) 
Then by Eq. (5.154), we have 
d, 
5 E" ri = a 


or р) = Qt + ce ^^ 
where c is an integration constant. Since p,(0) = Р[Х(0) = 1] = 0, we obtain 
р) = Ate ^" (5.157) 
To show that 
рф) = e " xà 


we use mathematical induction. Ássume that it is true for n — 1; that is, 


a (Ar^! 


Pa- = е (п = 1)! 
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Substituting the above expression into Eq. (5.156), we have 


нн 1 


(n — 1)! 





d At — 
di [e"p(0] = 


Integrating, we get 





; At 
e“'p,(t) = ) + 1 
n! 
Since p,(0) = 0, c, = 0, and we obtain 
„(А 
pt) =e“ ar (5.158) 


which is Eq. (5.55) of Definition 5.6.1. Thus we conclude that Definition 5.6.2 implies Definition 5.6.1. 


Verify Eq. (5.59). 
We note first that X(t) can assume only nonnegative integer values; therefore, the same is true for the 
counting increment X(t + At) — X(t). Thus, summing over all possible values of the increment, we get 


Y P[X(t + Ar) - X(t) = К] = P[X(t + AD — X(t) = 0] 
k=0 


+ P[X(t + At) - X(t) = 1] + PLX(t + At) — X(t) > 2] 
= 1 
Substituting conditions 3 and 4 of Definition 5.6.2 into the above equation, we obtain 


PLX(t + At) — X(t) =0] = 1 — å At + o(Ar) 


(a) Using the Poison probability distribution in Eq. (5.758), obtain an analytical expression for 
the correction term o(At) in the expression (condition 3 of Definition 5.6.2) 


P[X(t + At) - X(t) = 1] = A At + o(At) (5.159) 
(b) Show that this correction term does have the property of Eq. (5.58); that is, 
am at 
(a) Since the Poisson process X(t) has stationary increments, Eq. (5.159) can be rewritten as 
P[X(At) = 1] = p,(At) = A At + o(At) (5.160) 
Using Eq. (5.158) [or Eq. (5.157)], we have 


р(А) = À Ate?“ = А Atl + e774" — 1) 
= À At + À Аце 1) 


Equating the above expression with Eq. (5.160), we get 
A At + o(At) = å At + A At(e ^^! — 1) 


from which we obtain 


o(At) = А At(e ^ ^ — 1) (5.161) 
(b) From Eq. (5.161), we have 
At A At(e ^^ — 1 
lim 20 па AE Dom Xe? 1) =0 
ao Al Ar^ 0 At ato 


5.52. Find the autocorrelation function R,(t, s) and the autocovariance function K x(t, s) of a Poisson 


process X(t) with rate 2. 
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5.53. 


5.54. 


5.55. 


From Eas. (5.56) and (5.57), 
E[X(t)] = at Var[X(t)] = at 


Now, the Poisson process X(t) is a random process with stationary independent increments and X(0) = 0. 
Thus, by Eq. (5.103) (Prob. 5.23), we obtain 


K,(t, s) = o,? min(t, s) = А min(t, s) (5.162) 
since су? = Var[ X(1)] = А. Next, since E[ X(t)] ELX(s)] = A?ts, by Eq. (5.10), we obtain 
Ry(t, 5) = А min(t, s) + А215 (5.163) 


Show that the time intervals between successive events (or interarrival times) in a Poisson 
process X(t) with rate А are independent and identically distributed exponential r.v.’s with 
parameter A. 


Let Z,, Za, ... be the r.v.'s representing the lengths of interarrival times in the Poisson process X(t). 
First, notice that {Z, > t) takes place if and only if no event of the Poisson process occur in the interval 
(0, t), and thus by Eq. (5.154), 


P(Z, > t) = P{X() = 0) 2e 
or Fz) = Z, st)=1— e7” 
Hence Z, is an exponential г.у. with parameter A [Eq. (2.49)]. Let f(t) be the pdf of Z,. Then we have 


P(Z,>0= fee: > t{Z, = т) (т) ат 
= | PLX(t + t) — X(t) = 0] f(t) ат 


=e * [ле dv-e (5.164) 


which indicates that Z, is also an exponential r.v. with parameter А and is independent of Z,. Repeating the 
same argument, we conclude that Z,, Z,,... are iid exponential r.v.'s with parameter А. 


Let T, denote the time of the nth event of a Poisson process X(t) with rate 4. Show that T, is a 
gamma r.v. with parameters (n, 4). 


Clearly, 
L-Z tZ tZ, 
where Z,, n = 1, 2, ... , are the interarrival times defined by Eq. (5.149). From Prob. 5.53, we know that Z, 


are iid exponential r.v.’s with parameter A. Now, using the result of Prob. 4.33, we see that T, is a gamma 
r.v. with parameters (n, 4), and its pdf is given by [ Eq. (2.76)]: 


At^! 
-a (At) (20 


749 = ^ (n- D! (5.165) 
0 t «0 





The random process {T}, n > 1} is often called an arrival process. 


Suppose t is not a point at which an event occurs in a Poisson process X(t) with rate A. Let W(t) 
be the r.v. representing the time until the next occurrence of an event. Show that the distribution 
of W(t) is independent of t and W(t) is an exponential r.v. with parameter 4. 


Let 5 (0 € s « t) be the point at which the last event [say the (n — 1)st event] occurred (Fig. 5-14). The 
event { W(t) > т} is equivalent to the event 


{Z, >t—s+t|Z,>t—s} 
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Thus, using Eq. (5.764), we have 


P[W(t) > т] = P(Z, > t-s + t|Z, >t- 83) 
P(Z,»t—s-cco) e 0M 
PIZ,>t—s) ew» 


and P[W(t) < т] = 1 — e* (5.166) 





e ^ 


which indicates that W(t) is an exponential r.v. with parameter 4 and is independent of t. Note that W(t) is 
often called a waiting time. 


5.56. Patients arrive at the doctor's office according to a Poisson process with rate А = 15 minute. The 
doctor will not see a patient until at least three patients are in the waiting room. 


(a) Find the expected waiting time until the first patient is admitted to see the doctor. 
(b) What is the probability that nobody is admitted to see the doctor in the first hour? 
(a) Let T, denote the arrival time of the nth patient at the doctor's office. Then 
Л,=2+27,+ +Z, 
where Z,, n = 1, 2, ... , are iid exponential г.у with parameter 4 = т. By Eqs. (4.108) and (2.50), 


ET) = p 2) = Y E(Z) =n - (5.167) 


The expected waiting time until the first patient is admitted to see the doctor is 
E(T3) = 3(10) = 30 minutes 
(b) Let X(t) be the Poisson process with parameter 4 = 15. The probability that nobody is admitted to see 
the doctor in the first hour is the same as the probability that at most two patients arrive in the first 60 
minutes. Thus, by Eq. (5.55), 
P[X(60) — X(0) < 2] = P[X(60) — X(0) = 0] + P[X(60) — X(0) = 1] + PLX(60) — X(0) = 2] 
= g 60/10 + e 69/100) + e %91101090)2 


=e (1 +6 + 18) = 0.062 


5.57. Let T, denote the time of the nth event of a Poisson process X(t) with rate А. Suppose that one 
event has occurred in the interval (0, t). Show that the conditional distribution of arrival time T, 
is uniform over (0, t). 
Fort xt, 
PIT, <1, X(t) = 1] 
P[X(t) = 1] 
_ P[X(x) = 1, X(t) — Х(т) = 0] 
i P[X(t) = 1] 
. PLX(t) = !]P[X(t) — X(t) = 0] 
i P[X(t) = 1] 
Ate "eg 170 


Ate #! 


PLT, x«X()-1]—- 


(5.168) 


эра 


which indicates that T, is uniform over (0, t) [see Eq. (2.45)]. 


CHAP. 5] RANDOM PROCESSES 203 


5.58. 


Consider a Poisson process X(t) with rate A, and suppose that each time an event occurs, it is 
classified as either a type 1 or a type 2 event. Suppose further that the event is classified as a type 
1 event with probability p and a type 2 event with probability 1 — p. Let X ,(t) and X,(t) denote 
the number of type 1 and type 2 events, respectively, occurring in (0, t). Show that (X (1), t > 0} 
and {X,(t), t = 0) are both Poisson processes with rates Ap and A(1 — p), respectively. Further- 
more, the two processes are independent. 


We have 

X(t) = X(t) + X(t) 

First we calculate the joint probability PLX ,(t) = k, X(t) = m]. 

P[X (t) = k, X(t) = m] = У PLX,(r) = k, X(t) = m| X(t) = п]Р[Х() = n] 

n=0 

Note that 

P[X,(t) = k, X,(t) = m| X(t) =n] =0 whenn#k+m 
Thus, using Eq. (5.158), we obtain 
PLX (0) = k, X(t) = m] = PLX (0 = k, Ху) = m} X() = k  m]PLX() = k + т] 


( jtm 


= P[X,(t) =k, X;(t) = m| X(t) =k+mJe* 





(К + m)! 


Now, given that k + m events occurred, since each event has probability p of being a type 1 event and 
probability 1 — p of being a type 2 event, it follows that 


РІХ (0) = k, X(t) = m| X(t) =k +m] = ( т "y — р)" 


k+m 
k 


Thus PLX (t) =k x40 =m) =( уко — pyre 977. 

, 1 » 42 P P (k + m)! 
(+ т)! noa (ahem 
“йт PU "6" (К+ m)! 


=e tet (Арг) g a-p ГАО — р)" 


k! m! 








(5.169) 


Then P[X,() =k) = У P[X,(0 =k, X,(t) = т] 
m-i 
-ap (АРОК цари = LAO — pr" 
= Apt {1—ри 
^ p] X m! 
-ap APE Lia pia - 
Lg» e AG ptg ái ph 
k! 
_ e- ap (Apt) 


m (5.170) 


which indicates that X (t) is a Poisson process with rate Ap. Similarly, we can obtain 


РХ) = т] = È Р(Х) = k Xs) = т] 


1 
= g ^ -pi [AQ = p)t}" 
m! 


(5.171) 


and so X,(t) is a Poisson process with rate A(1 — p). Finally, from Eqs. (5.170), (5.171), and (5.169), we see 
that 
РХ (0) = k, X(t) = т] = Р[Х (0) = K]P[X (t) = m] 


Hence, X (t) and X (t) are independent. 
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5.59, 


5.60. 


Let X,, ..., X, be jointly normal r.v.'s. Show that the joint characteristic function of X}, ..., X, 
is given by 


. n 1 n n 
Ҹу... x0 +++ On) = sx 2. 0, 4; = У У оь) (5.172) 
і=1 i=1 k51 
where u; = E(X;) and o; = Cov(X;, Ху). 
Let Y¥Y=a,X,+a,X,+-:-+4,X, 


By definition (4.50), the characteristic function of Y is 
Yo) = E[gietXi tax» = Vx, u х,а}, ++, Wap) (5.173) 


Now, by the results of Prob. 4.55, we see that Y is a normal r.v. with mean and variance given by [Eqs. 
(4.108) and (4.111)] 


Hy = E(Y) = У a; E(X) = Y аи, (5.174) 
i=1 i=l 


ay? = Var(Y)= Y. Y аа, Cow(X,, X) = Y. Y аас, (5.175) 
i=] k=l i=1k=1 
Thus, by Eq. (4.125), 
Yw) = ехр[ јоу — 30,707] 


= exp( jo 3 au —io Y У аа, ru) (5.176) 
i=1 i=l k=l 
Equating Eqs. (5.176) and (5.173) and setting о = 1, we get 
п 1 п п 
VY yis xa 2) = 6) di Hi — 2 У У аа в) 
= 1 i=} k=l 


By replacing a,’s with œs, we obtain Eq. (5.172); that is, 


п 1 п п 
Чү, x0, s ©) = exo( i Xom 2 У Yao, в) 
i=1 i 


Bi €, 911. 77 Oin 
Let w=]: о = | : К = [03] = | : 
Hn C, On Onn 
Then we can write 
У co, By = eg Y Va eon = "Ко 


i=l i=1 k=1 
and Eq. (5.172) can be expressed more compactly as 


V yu 00-5 On) = expl jo — 3o" Ko) (5.177) 


Let X,,..., X, be jointly normal r.v.'s Let 
Y, — au X, + ttt + a,,X, 
: (5.178) 
Ү = Ami X y tc + Onn X, 
where a, (i = 1,...,m;j=1,..., п) are constants. Show that Y,, ..., Ү, are also jointly normal 
r.V.'s. 
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5.61. 


5.62. 


X, Y, ау c а, 


тп, 


Then Eq. (5.178) can be expressed as 
Y = AX (5.179) 


Hy ©, Fir 777 Oin 
Let ux = E(X) =] : о= |: Kx = [oa] = 


Hn Om, 2,1 UST Onn 


Then the characteristic function for Ү can be written as 
Ф401, ..., Op) = E(e/?" Y) = Elei) 
= E[eX4*9"X] = py Ata) 
Since X is a normal random vector, by Eq. (5.177) we can write 
Vx(A* 0) = exp[j(A7@) иу — 3(A7@)"Ky(A7@)] 
= exp[ ja’ Ану — 4o" AK, А70] 

Thus V (o, ..., On) = exp(jo" n, — io^ Kyo) (5.180) 
where Hy = Ану Ky = AK, A" (5.181) 


Comparing Eqs. (5.177) and (5.180), we see that Eq. (5.180) is the characteristic function of a random vector 
Y. Hence, we conclude that Y,,..., Y, are also jointly normal r.v.’s 

Note that on the basis of the above result, we can say that a random process { X(t), t e T] is a normal 
process if every finite linear combination of the r.v.’s X(t,), t; € T is normally distributed. 


Show that a Wiener process X(t) is a normal process. 


Consider an arbitrary linear combination 


Y a, XU) = a, X(t,) + ay X(t) + а, X(t,) (5.182) 
i=1 


where 0 < t, <. < t, and а, are real constants. Now we write 
"n 

a; X(t) = (a, + +++ + а) [Хх — X(0)] + (а, ++ + а, СХ) — X(6)] 
=1 


(asa + OLX (ty 1) — X] + aX) – X(t, 1) (5.183) 


Now from conditions 1 and 2 of Definition 5.7.1, the right-hand side of Eq. (5.183) is a linear combination 
of independent normal r.v.’s. Thus, based on the result of Prob. 5.60, the left-hand side of Eq. (5.183) is also 
a normal r.v.; that is, every finite linear combination of the r.v.'s X(t;) is a normal r.v. Thus we conclude that 
the Wiener process X(t) is a normal process. 


A random process ( X(t), t e T] is said to be continuous in probability if for every e > Oand t e T, 


lim P{| X(t + h) — X(t)] > e] = 0 (5.184) 


h—0 
Show that a Wiener process X(t) is continuous in probability. 
From Chebyshev inequality (2.97), we have 


PIX( & hy — XQ)» 6) s SAEED А0) к> 
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5.64, 


5.65. 


5.66. 


5.67. 
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Since X(t) has stationary increments, we have 
Var[X(t + А) — X(t)] = Var[X(A)] = o?h 


in view of Eq. (5.63). Hence, 


2 
lim P(1X( Ю — X(0| > e] = lim 77 = 0 


h20 h20 


Thus the Wiener process X(t) is continuous in probability. 


Supplementary Problems 


Consider a random process X(n) = (X,, n > 1}, where 
X, =Z, *Z 4 Z, 
and Z, are iid r.v.'s with zero mean and variance o?. Is X(n) stationary? 


Ans. No. 


Consider a random process X(t) defined by 


X(t) = Y cos(wt + ©) 


[CHAP 5 


where Y and Ө are independent r.v.’s and are uniformly distributed over ( — A, A) and (—z, л), respectively. 


(a) Find the mean of X(t). 
(b) Find the autocorrelation function Rx(t, 5) of X(t). 


Ans. (a E[X(0)]-0; (Б) Ry(t, 5) = 4A? cos a(t — s) 


Suppose that a random process X(t) is wide-sense stationary with autocorrelation 


Ry(t, t + q) = e I2 


(a) Find the second moment of the г.у, X(5). 
(b) Find the second moment of the г.у. X(5) — X(3). 


Ans (a) E[X']e 1; (b) Е([Х(5) — X(3)]?} = 20 — e?) 


Consider a random process X(t) defined by 


X(t) = О cos г + (И + Dsint — 00 <1< 00 


where U and V аге independent r.v.’s for which 
E(U) = E(V) = 0 E(U?) = EV?) = 1 
(a) Find the autocovariance function K (t, s) of X(t). 
(b) Is X(t) WSS? 
Ans. (а) Ky(t, 5) = cos(s — t); (b) No. 


Consider the random processes 


X(t) = Ag cos(@gt + ©) Y(t) = A, cos(o,t + Ф) 


where Ao, A,, wy, and о, are constants, and r.v.’s © and Ф are independent and uniformly distributed over 


(—т, л). 
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5.68. 


5.69. 


5.70. 


5.71. 


5.72. 


5.73. 


5.74. 


(a) Find the cross-correlation function of Ry,(t, t + т) of X(t) and Y(t). 
(b) Repeat (a) if © = ф. 
Ans. (a) Ryy;(t,¢+7)=0 


А, A 
(b Ryt, 6+ т) —— 





cos[(o, — 00) + wt) 


Given a Markov chain ( X,, n > 0}, find the joint pmf 
Р(Х 2ig, X, = і, ..., Х, = 6) 
Hint: Use Eq. (5.32). 
Ans. Pi Pis Pius UC Di, 
Let (X,, n > 0} be a homogeneous Markov chain. Show that 
Р(Х, = К... Xnam = Kml Xo = io ey Х, = 0) = Р(Х, = К,,..., Xn = Kyl Xo =) 
Hint: Use the Markov property (5.27) and the homogeneity property. 
Verify Eq. (5.37). 


Hint: Write Eq. (5.39) in terms of components. 


Find Р" for the following transition probability matrices: 


ШЕ 1 0 0 1 0 0 
(a) P- [os os] b) P=|0 1 0 (О P=|0 1 0 
7 0 01! 03 02 05 


! 0 0 0 0 0 
© Р=| 0 1 ol|«(sy| o 0 0 
06 04 0 -06 -04 1 


A certain product is made by two companies, A and B, that control the entire market. Currently, A and B 
have 60 percent and 40 percent, respectively, of the total market. Each year, A loses 4 of its market share to 
B, while B loses 5 of its share to A. Find the relative proportion of the market that each hold after 2 years. 


Ans. A has 43.3 percent and B has 56.7 percent. 


Consider a Markov chain with state {0, 1, 2} and transition probability matrix 


0 4 1i 
P-|4 0 4 
1 0 0 


Is state 0 periodic? 
Hint: Draw the state transition diagram. 


Ans. No. 


Verify Eq. (5.51). 
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5.75. 


5.76. 


5.77. 


5.78. 


5.79. 


5.80. 


5.81. 


5.82. 
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Hint: Let = [N 4], where Ny, is the number of times the state К(є B) is occupied until absorption takes 
place when X(n) starts in state j(e B). Then Т, = У. „1 Ny; calculate E(N p). 


Consider a Markov chain with transition probability matrix 


06 02 02 
Р=|04 05 01 
06 0 04 


Find the steady-state probabilities. 

Ans. р= [5 ô $] 

Let X(t) be a Poisson process with rate 4. Find E[ X (r)]. 
Ans. AL Ar? 


Let X(t) be a Poisson process with rate 4. Find E([X(t) — X(s)]^) for t > s. 
Hint: Use the independent stationary increments condition and the result of Prob. 5.76. 


Ans. A(t— s) + A(t У)? 


Let X(t) be a Poisson process with rate 4. Find 


PEXU- d =k Xj] d>0 
^ 967 
7 ку ки\ r JNM 


Let Т, denote the time of the nth event of a Poisson process with rate 4. Find the variance of T,. 


Ans. n/À 


Assume that customers arrive at a bank in accordance with a Poisson process with rate A = 6 per hour, and 
suppose that each customer is а man with probability 2 and a woman with probability 4. Now suppose 
that 10 men arrived in the first 2 hours. How many woman would you expect to have arrived in the first 2 
hours? 


Ans. 4 
Let X,,.... X, be jointly normal r.v.’s. Let 

Y = Х, +С; i= J,...,n 
where c; are constants. Show that Y,,..., Y, are also jointly normal r.v.’s. 


Hint: See Prob. 5.60. 


Derive Eq. (5.63). 
Hint: Use condition (1) of a Wiener process and Eq. (5.102) of Prob. 5.22. 


Chapter 6 


Analysis and Processing of Random Processes 


61 INTRODUCTION 


In this chapter, we introduce the methods for analysis and processing of random processes. First, 
we introduce the definitions of stochastic continuity, stochastic derivatives, and stochastic integrals of 
random processes. Next, the notion of power spectral density is introduced. This concept enables us 
to study wide-sense stationary processes in the frequency domain and define a white noise process. 
The response of linear systems to random processes is then studied. Finally, orthogonal and spectral 
representations of random processes are presented. 


6.2 CONTINUITY, DIFFERENTIATION, INTEGRATION 


In this section, we shall consider only the continuous-time random processes. 


A. Stochastic Continuity: 
A random process X(t) is said to be continuous in mean square or mean square (m.s.) continuous if 


lim E((X(t + е) — X(0]?] = 0 (6.1) 
£20 
The random process X(t) is m.s. continuous if and only if its autocorrelation function is continuous 
(Prob. 6.1). If X(t) is WSS, then it is m.s. continuous if and only if its autocorrelation function Кт) is 
continuous at т = 0. If X(t) is m.s. continuous, then its mean is continuous; that is, 


lim py(t + £) = ux(t) (6.2) 
£20 
which can be written as 
lim E[ X(t + в)] = Ет X(t + &)] (6.3) 
£0 r20 


Hence, if X(t) is m.s. continuous, then we may interchange the ordering of the operations of expecta- 
tion and limiting. Note that m.s. continuity of X(t) does not imply that the sample functions of X(t) 
are continuous. For instance, the Poisson process is m.s. continuous (Prob. 6.46), but sample func- 
tions of the Poisson process have a countably infinite number of discontinuities (see Fig. 5-2). 


B. Stochastic Derivatives: 


А random process X(t) is said to have a m.s. derivative X'(t) if 


. X(t +e) — X(t) 
].i.m. — 


z20 


= X(t) (6.4) 


where l.i.m. denotes limit in the mean (square); that is, 


2 
lim je xa — xo] | =0 (6.5) 
20 


£ 
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The m.s. derivative of X(t) exists if 22А,(1, s)/ét 05 exists (Prob. 6.6). If X(t) has the m.s. derivative 
X(t), then its mean and autocorrelation function are given by 


d 


E[X'(t)] = di E[X(t)] = ux(t) (6.6) 
O?Ry(t, s) 
Ry(t, 5) = has (6.7) 


Equation (6.6) indicates that the operations of differentiation and expectation may be interchanged. If 
X(t) is a normal random process for which the m.s. derivative X'(t) exists, then X'(t) is also a normal 
random process (Prob. 6.10). 


C. Stochastic Integrals: 


A m.s. integral of a random process X(t) is defined by 


t 
Y(t) = [хы da = lim. У X(t) At; (6.8) 
tọ Au>0 i 
where tọ < t; << t and At; = 1; — fj. 
The m.s. integral of X(t) exists if the following integral exists (Prob. 6.11): 


[ f R,x(a, B) da dB (6.9) 


о vio 


This implies that if X(t) is m.s. continuous, then its m.s. integral Y(t) exists (see Prob. 6.1). The mean 
and the autocorrelation function of Y(t) are given by 


uy(t) = {| [хо а | = ахы da = [Ге da (6.10) 
Ry(t, s) = {| [xo da [xo | 
= [ [хохи dB da = f [1 p) 4B da (6.11) 


Equation (6.10) indicates that the operations of integration and expectation may be interchanged. If 
X(t) is a normal random process, then its integral Y(t) is also a normal random process. This follows 
from the fact that X, X(t;) At; is a linear combination of the jointly normal r.v.’s. (see Prob. 5.60). 


63 POWER SPECTRAL DENSITIES 


In this section we assume that all random processes are WSS. 


A. Autocorrelation Functions: 


The autocorrelation function of a continuous-time random process X(t) is defined as [Eq. (5.7)] 


Ry(1) = Е[Х(@Х( + т)] (6.12) 

Properties of R(t): 
1. Ry(—t) = Кт) (6.13) 
2. | Rx(t)| < К,(0) (6.14) 


3. К,(0) = E[X()] 2 0 (6.15) 
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Property 3 [Eq. (6.15)] is easily obtained by setting т = 0 in Eq. (6.12). If we assume that X(t) is a 
voltage waveform across a 1-Q resistor, then E[X?(t)] is the average value of power delivered to the 
1-Q resistor by X(t). Thus, E[X?(t)] is often called the average power of X(t). Properties 1 and 2 are 
verified in Prob. 6.13. 

In case of a discrete-time random process X(n), the autocorrelation function of X(n) is defined by 


КЮ) = E[X(n)X(n + Ю] (6.16) 


Various properties of К, (К) similar to those of R(t) can be obtained by replacing т Бу k in Eqs. (6.13) 
to (6.15). 


B. Cross-Correlation Functions 


The cross-correlation function of two continuous-time jointly WSS random processes X(t) and 
Y(t) is defined by 


Ryy(t) = E[X()Y(t + т)] (6.17) 

Properties of Ry,(1): 
1. Куүу(—т) = Ryx(t) (6.18) 
2. |Ryy(1)| € / Rx(0)R x0) (6.19) 
3. |Ryy()] € 3[Ку(0) + Кү(0)] (6.20) 


These properties are verified in Prob. 6.14. Two processes X(t) and Y(t) are called (mutually) orthog- 
onal if 


Ryy(1) = 0 for all т (6.21) 


Similarly, the cross-correlation function of two discrete-time jointly WSS random processes X(n) and 
Y(n) is defined by 


Ryy(k) = ELX(n)Y(n + k] (6.22) 


and various properties of Ry ,(k) similar to those of Ry,(x) can be obtained by replacing т by k in Eqs. 
(6.18) to (6.20). 


C. Power Spectral Density: 


The power spectral density (or power spectrum) $ (о) of a continuous-time random process X(t) is 
defined as the Fourier transform of R(t): 


Sw) = f Ry{t)e~* dc (6.23) 
Thus, taking the inverse Fourier transform of Sy(w), we obtain 
1 [* | 
Ry(1) = эл | $ (о)е?°®* do (6.24) 


Equations (6.23) and (6.24) are known as the Wiener-Khinchin relations. 


Properties of 50): 
1. $у(ф) is real and S,(@) > 0. (6.25) 
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2. $у(—) = So) (6.26) 


3. ELX(t)) = Ry(0) -> [ Salo) до (6.27) 


Similarly, the power spectral density S,(Q) of a discrete-time random process X(n) is defined as the 
Fourier transform of R x(k): 


SQ) = y Кет 99 (6.28) 


к= – о 


Thus, taking the inverse Fourier transform of 5 (О), we obtain 


Ry(k) = E [ Sx(Q)e/?* dQ (6.29) 

Properties of SQ): 
1. Sy(Q + 2n) = $у(О) (6.30) 
2. Sy(Q) is real and S,(Q) > 0. (6.31) 
3. Sy(—Q)-— SQ) (6.32) 
4. E[X?(n] = R,(0) = E [ SX(Q) dQ (6.33) 


Note that property 1 [Eq. (6.30)] follows from the fact that e~/* is periodic with period 2л. Hence it 
is sufficient to define $,(Q) only in the range (—z, x). 
D. Cross Power Spectral Densities: 


The cross power spectral density (or cross power spectrum) Sxy(œ) of two continuous-time random 
processes X(t) and Y(t) is defined as the Fourier transform of Ry,(1): 


Syy(@) = [` Ryylt)e і" dt (6.34) 


-00 


Thus, taking the inverse Fourier transform of 5 ;у(о), we get 


Ry,(t) = .- l Suo)" do (6.35) 


oc 


Properties of Sy): 


Unlike S,(@), which is a real-valued function of c, 5уу(о), in general, is a complex-valued func- 
tion. 


1. Syy(@) = Syx(—@) (6.36) 
2. Sxy(—@) = S¥y(@) (6.37) 


Similarly, the cross power spectral density $уу(О) of two discrete-time random processes X(n) and 
Y(n) is defined as the Fourier transform of Ryy(k): 


500) = У К (Юе ™ (6.38) 
к= – о 
Thus, taking the inverse Fourier transform of Sy,(Q), we get 


Ryy(k) = = | SADAY dO (6.39) 
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Properties of Syy(Q): 


Unlike Sy(O), which is a real-valued function of о, Sy,(Q), in general, is a complex-valued func- 
tion. 


1. Syy(Q + 2л) = Sy (O) (6.40) 
2. Syy(O) = Syx(—O) (6.41) 
3. Sy —Q) = St,(Q) (6.42) 


6.4 WHITE NOISE 


A continuous-time white noise process, W(t), is a WSS zero-mean continuous-time random 
process whose autocorrelation function is given by 


Ку(т) = о2д(1) (6.43) 
where б(т) is a unit impulse function (or Dirac 6 function) defined by 
| (t)P(t) dt = ф(0) (6.44) 
where ф(т) is any function continuous at т = 0. Taking the Fourier transform of Eq. (6.43), we obtain 
Sylo) = о? | ó(te I dt = о? (6.45) 
which indicates that X(t) has a constant power spectral density (hence the name white noise). Note 
that the average power of W(t) is not finite. 


Similarly, a WSS zero-mean discrete-time random process W(n) is called a discrete-time white noise 
if its autocorrelation function is given by 


КК) = с?д(К) (6.46) 
where 6(k) is a unit impulse sequence (or unit sample sequence) defined by 
1 k=0 
k)- . 
ó(k) ‘0 k #0 (6.47) 
Taking the Fourier transform of Eq. (6.46), we obtain 
Sy{Q) = 0? Y Ke M* = o° —n<Q<n (6.48) 
к= – о 


Again the power spectral density of И (и) is a constant. Note that $,(Q + 2л) = S,(Q) апа the 
average power of W(n) is c? = Var[W(n)], which is finite. 


65 RESPONSE OF LINEAR SYSTEMS TO RANDOM INPUTS 
A. Linear Systems: 


A system is a mathematical model of a physical process that relates the input (or excitation) 
signal x to the output (or response) signal y. Then the system is viewed as a transformation (or 
mapping) of x into y. This transformation is represented by the operator T as (Fig. 6-1) 


y=Tx (6.49) 


If x and y are continuous-time signals, then the system is called a continuous-time system, and if x 
and y are discrete-time signals, then the system is called a discrete-time system. If the operator T is a 
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x y 
System 
T 


Fig. 6-1 


linear operator satisfying 
T{x, + xj = Tx, + Tx, = у + у, (Additivity) 
T{ax} = aTx = ay (Homogeneity) 


where « is a scalar number, then the system represented by T is called a linear system. A system is 
called time-invariant if a time shift in the input signal causes the same time shift in the output signal. 
Thus, for a continuous-time system, 


T {x(t — t9) = y(t — to) 
for any value of tọ, and for a discrete-time system, 
T{x(n — ng)] = y(n — nj) 


for any integer ng. For a continuous-time linear time-invariant (LTI) system, Eq. (6.49) can be 
expressed as 


y(t) = | h(A)x(t — А) da (6.50) 
where h(t) = Т{б()} (6.51) 
is known as the impulse response of a continuous-time LTI system. The right-hand side of Eq. (6.50) is 
commonly called the convolution integral of h(t) and x(t), denoted by A(t) » x(t). For a discrete-time 
LTI system, Eq. (6.49) can be expressed as 


ym = Y хп – i) (6.52) 


where h(n) = T{d(n)} (6.53) 


is known as the impulse response (or unit sample response) of a discrete-time LTI system. The right- 
hand side of Eq. (6.52) is commonly called the convolution sum of h(n) and x(n), denoted by h(n) * x(n). 


B. Response of a Continuous-Time Linear System to Random Input: 


When the input to a continuous-time linear system represented by Eq. (6.49) is a random process 
(X(0, ге Т,), then the output will also be a random process { Y(t), t e T,}; that is, 


T(X(t, te T,} = (Y(), t € T} (6.54) 
For any input sample function x,(t), the corresponding output sample function is 
уќ) = Tix()) (6.55) 


If the system is LTI, then by Eq. (6.50), we can write 


Ү() = | кух — a) dà (6.56) 


Note that Eq. (6.56) is a stochastic integral. Then 


a 


Е[Ү(] = | A(A)ELX(t — А] dà (6.57) 
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The autocorrelation function of Y(t) is given by (Prob. 6.24) 


Ry(t, s) = | | h(a)A(B)R x(t — a, s — B) da df (6.58) 
If the input X(t) is WSS, then from Eq. (6.57), 


E[Y()] = ux F ҚА) dà = ux H(0) (6.59) 


where H(0) = H(o)|,-9 and H(w) is the frequency response of the system defined by the Fourier 
transform of h(t); that is, 


Н(о) = F h(t)e і dt (6.60) 
The autocorrelation function of Y(t) is, from Eq. (6.58), 
Ry(t, 5) = F |" коник, — t +a — fl) da df (6.61) 
Setting s = t + c, we get 
Ryt t +1)= F [` h(x)h(f)R y(t + a — f) da dB = Кү(х) (6.62) 


From Eqs. (6.59) and (6.62), we see that the output Y(t) is also WSS. Taking the Fourier transform of 
Eq. (6.62), the power spectra! density of Y(t) is given by (Prob. 6.25) 


Sy(@) = [` К (т)е 7 дт = | Н(о) | Sw) (6.63) 


Thus, we obtain the important result that the power spectral density of the output is the product of the 
power spectral density of the input and the magnitude squared of the frequency response of the system. 

When the autocorrelation function of the output Ry(t) is desired, it is easier to determine the power 
spectral density Sy(w) and then evaluate the inverse Fourier transform (Prob. 6.26). Thus, 


Ry) = Е l Shoei do = - | |H) |?Sy(w)e* до (6.64) 


By Eq. (6.15), the average power in the output Y(t) is 


© 


E[Y*()] = Ry(0) = = { | H(w) |28 (о) dw (6.65) 


C. Response of a Discrete-Time Linear System to Random Input: 


When the input to a discrete-time LTI system is a discrete-time random process X(n), then by Eq. 
(6.52), the output Y(n) is 


Y(n) = y h(i)X(n — i) (6.66) 


i=-@ 


The autocorrelation function of Y(n) is given by 


Ryn, т) = Y Y ОРОК. y(n — i, m — 1D) (6.67) 


i-—o l=—« 
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When X(n) is WSS, then from Eq. (6.66), 


ELY(n)] = ux >, №) = ux H(0) (6.68) 


{= – о 


where Н(0) = Н(О)| д6 and H(Q) is the frequency response of the system defined by the Fourier 
transform of h(n): 


HQ) = y h(n)e 4" (6.69) 
The autocorrelation function of Y(n) is, from Eq. (6.67), 
Ry(n, m) = p X МВК ут —n- i—1) (6.70) 
Setting m =n + k, we get 
къп = Y X ООК + i — D = Ry(k) (6.71) 


From Eas. (6.68) and (6.71), we see that the output Y(n) is also WSS. Taking the Fourier transform of 
Eq. (6.71), the power spectral density of Y(n) is given by (Prob. 6.28) 


SQ) = |Н(О) 25,0) (6.72) 
which is the same as Eq. (6.63). 


6.6 FOURIER SERIES AND KARHUNEN-LOEVE EXPANSIONS 
A. Stochastic Periodicity: 
A continuous-time random process X(t) is said to be m.s. periodic with period T if 
E{[X(t + T) — X()]?) = 0 (6.73) 


If X(t) is WSS, then X(t) is m.s. periodic if and only if its autocorrelation function is periodic with 
period T; that is, 


Кут + T) = Ry() (6.74) 


B. Fourier Series: 


Let X(t) be a WSS random process with periodic К (т) having period T. Expanding R,(t) into a 
Fourier series, we obtain 


oo 


Кут) = Y ce" Фо = 2n/T (6.75) 
] (7 А 
where 6 = т | Ку(т)е "2% dt (6.76) 
0 
Let X(t) be expressed as 
£= У Хе Wo = 2n/T (6.77) 


n= – о 
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where X, are r.v.’s given by 
1 ft А 
X= = | X(te 7° dt (6.78) 
T Jo 
Note that, in general, X, are complex-valued r.v.’s. For complex-valued r.v.’s, the correlation between 


two г.у’ X and Y is defined by E(X Y*). Then X(t) is called the m.s. Fourier series of X(t) such that 
(Prob. 6.34) 


E{|X() — X@)|?} =0 (6.79) 
Furthermore, we have (Prob. 6.33) 
Hx n-0 
E X = = А 
(X,) = ux д(п) |н п 20 (6.80) 
с п=т 
E(X, X*) = с,д(п– т) = +" . 
(X, X7) — c,ó(n — m) f ném (6.81) 
C. Karhunen-Loéve Expansion 
Consider a random process X(t) which is not periodic. Let X(t) be expressed as 
X()= у X elt O<t<T (6.82) 
n-i 
where a set of functions {@,(t)} is orthonormal on an interval (0, T) such that 
T 
| $.(t)en(t) dt = à(n — т) (6.83) 
0 
and X, are r.v.'s given by 
T 
Х, = { X (tor (t) dt (6.84) 
0 


Then X(t) is called the Karhunen-Loéve expansion of X(t) such that (Prob. 6.38) 
E{| X(t) — X()/?} 20 (6.85) 


Let R(t, s) be the autocorrelation function of X(t), and consider the following integral equation: 


{ "кү, 3)$,(s) ds =A pd O<t<T (6.86) 


where 4, and $,(t) are called the eigenvalues and the corresponding eigenfunctions of the integral 
equation (6.86). It is known from the theory of integral equations that if Ry(t, 5) is continuous, then 
@,(t) of Eq. (6.86) are orthonormal as in Eq. (6.83), and they satisfy the following identity: 


уб, s) = Ў 1 6404209 (6.87) 


which is known as Mercer's theorem. 
With the above results, we can show that Eq. (6.85) is satisfied and the coefficient X, are orthog- 
onal r.v.'s (Prob. 6.37); that is, 


An n=m 


0 n*m (6.88) 


E(X, X*) = 4,6(n — т) = | 
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6.7 FOURIER TRANSFORM OF RANDOM PROCESSES 
A. Continuous-Time Random Processes: 
The Fourier transform of a continuous-time random process X(t) is a random process X(w) given 
by 
X(o) = | X(t)e 1 dt (6.89) 


which is the stochastic integral, and the integral is interpreted as an m.s. limit; that is, 





sia — | ° X(t)e і dt | =0 (6.90) 


-= 


Note that (ол) is a complex random process. Similarly, the inverse Fourier transform 
1 [® ; 
Ха) = | Х(соо)е!°®! do (6.91) 
2n jJ. 


is also a stochastic integral and should also be interpreted in the m.s. sense. The properties of 
continuous-time Fourier transforms (Appendix B) also hold for random processes (or random 
signals). For instance, if Y(t) is the output of a continuous-time LTI system with input X(t), then 


Y(o) = X(w)H(w) (6.92) 


where H(w) is the frequency response of the system. 
Let Ry(w,, о) be the two-dimensional Fourier transform of R y(t, s); that is, 


Rx(o,, @2) = r F Ry(t, 5)е 91 +29 dt ds (6.93) 
Then the autocorrelation function of (о) is given by (Prob. 6.41) 
Rx(@,, @2) = E[X(w,)X*(w2)] = Ry(v,, —@.) (6.94) 
If X(t) is real, then 
E[X(v,)X(o;)] = xw, @2) (6.95) 
X(—o) = X*(w) (6.96) 
Ry(—w@,, –0,) = Rt(o,, w2) (6.97) 


If X(t) is a WSS random process with autocorrelation function Rx(t, s) = Ry(t — s) = Ry{t) and power 
spectral density S,(w), then (Prob. 6.42) 


Rw, @2) = 20S w lw + о,) (6.98) 
Rz(o,, 95) = 2n8y(o,)0(c, — c) (6.99) 


Equation (6.99) shows that the Fourier transform of a WSS random process is nonstationary white 
noise. 


B. Discrete-Time Random Processes: 
The Fourier transform of a discrete-time random process X(n) is a random process X(Q) given by 
(in m.s. sense) 


XQ) = y Х(п)е i» (6.100) 


a 
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Similarly, the inverse Fourier transform 
I f" | 
Х(и) = on | X (e dO (6.101) 
should also be interpreted in the m.s. sense. Note that X(Q + 2л) = X(Q) and the properties of 


discrete-time Fourier transforms (Appendix B) also hold for discrete-time random signals. For 
instance, if Y(n) is the output of a discrete-time LTI system with input X(n), then 


Y(0) = X(0)H(Q) (6.102) 


where H(Q) is the frequency response of the system. 
Let R(Q, О,) be the two-dimensional Fourier transform of Ry(n, m): 


R,(Q,, Q,) = У У Ry(n, me” "+ 92m (6.103) 
Then the autocorrelation function of X(Q) is given by (Prob. 6.44) 
R(Q, Q2) = E[X(Q,)X*Q,)) = R,(Q,, -9,) (6.104) 
If X(n) is а WSS random process with autocorrelation function R,(n, т) = Ry(n — т) = R,(k) and 
power spectral density $,(Q), then 
RAQ, Qa) = 2n$,(0,)6(O, + О,) (6.105) 
Rg(Q,, Q5) = 218,(0,)0(Q, — Ә,) (6.106) 


Equation (6.106) shows that the Fourier transform of a discrete-time WSS random process is nonsta- 
tionary white noise. 


Solved Problems 


CONTINUITY, DIFFERENTIATION, INTEGRATION 
61. Show that the random process X(t) is m.s. continuous if and only if its autocorrelation function 

Ry(t, s) is continuous. 

We can write 
Е Хи + £) — X(0]?] = ELX?( +) — 2X(t + 2X(0 + X^()] 
= Ry(t +e, t €) — 2Ry(t + є, t) + Ry(t, t) (6.107) 
Thus, if R x(t, s) is continuous, then 
lim E([X(t + £) — X()]?) = lim {Ry(t + £, t €) - 2Ry(t + e, t) + Күй, 0] =0 


270 c0 
and X(t) is m.s. continuous. Next, consider 
Rylt + в, £+ £2) — Ry( t) = ELL X(t 6) — X(JEXG + €2) — X()]j 
+ E([XG 6) — X(0]X(0J + ЕХО + е) — X(01X(0J 
Applying Cauchy-Schwarz inequality (3.97) (Prob. 3.35), we obtain 
Ry(t + €r t+ 63) — Куй.) < (ЕХО 6) — XWP YE(LX U + е5) — XOP p'? 
+ ЕХ + e1) — X(N EDX?0)'?  GE(EXG + е) — X(OY JEEDX?(0D?? 
Thus if X(t) is m.s. continuous, then by Eq. (6.1) we have 
lim Ry(t +£, t +e) – Куп.) = 0 


£1, £270 


that is, Rx(£, s) is continuous. This completes the proof. 
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6.4. 


6.5. 
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Show that a WSS random process X(t) is m.s. continuous if and only if its autocorrelation 
function R(t) is continous at t = 0. 


If X(t) is WSS, then Eq. (6.107) becomes 
E{LX(t + е) — X())) = 26,00) — Rxle)] (6.108) 
Thus if К, (т) is continuous at т = 0, that is, 


lim [R y(e) — Ry(0)] = 0 


2-0 


then lim E{LX(t + £) — X(]?) = 0 


e-0 


that is, X(t) is m.s. continuous. Similarly, we can show that if X(t) is m.s. continuous, then by Eq. (6.108), 
R,(t) is continuous at т = 0. 


Show that if X(t) is m.s. continuous, then its mean is continuous; that is, 


lim ux(t + £) = ux(t) 


£0 
We have 
Var[X(t + e) — ХО] = E([X(t + £) - X(0]?) — (E[XG + е) — X(0]? 20 
Thus E([X( + в) — X())) > {Е[Х(ї + в) — ХО)? = Ги + £) — wy)? 


If X(t) is m.s. continuous, then as г — 0, the left-hand side of the above expression approaches zero. Thus 
lim [ux(t + e)— udt) 20 ог lim [ux(t + е) = uyt) 
Show that the Wiener process X(t) is m.s. continuous. 
From Eq. (5.64), the autocorrelation function of the Wiener process X(t) is given by 
Кт, 5) = o? min(t, s) 
Thus, we have 
|Ry(t + 6, t + €) — К, t)| = 6?| min(t + €,, £ + £3) — 1] < с? max(e,, £2) 


Since lim max(e,, &) = 0 
tj, €2^20 


Ry(t, 5) is continuous. Hence the Wiener process X(t) is m.s. continuous. 


Show that every m.s. continuous random process is continuous in probability. 
A random process X(t) is continuous in probability if, for every t and a > 0 (see Prob. 5.62), 


lim P(| X(t + е) — X(t)| > a} =0 
c-0 
Applying Chebyshev inequality (2.97) (Prob. 2.37), we have 
,EUXe + e) - XP) 


= a? 





P{| XU + е) — X(t) > а} 


Now, if X(t) is m.s. continuous, then the right-hand side goes to 0 as e 0, which implies that the left-hand 
side must also go to 0 as ғ 0. Thus, we have proved that if X(t) is m.s. continuous, then it is also 
continuous in probability. 


Show that a random process X(t) has a m.s. derivative X(t) if O^ Ry(t, 5)/0: д5 exists at s = t. 
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6.7. 


Let Y(t; в) = Хи + X +6) = хи) 


By the Cauchy criterion (see the note at the end of this solution), the m.s. derivative X (t) exists if 


lim (Уб: е) — Y(t; €,)]?} =0 


£j. 652—700 
Now E([Y(t; e3) — Y(t; )]^] = E[Y?(t; €;) — 2Y(t; e)Y(t; e) + Y?(t; є,)] 
= E[Y?(t; e;)] — 2E[Y(t; e) Y(t; €] + ELY (t: €,)] 


and — E[Y(s еу eJ] = — E(X( + e) — XOJUX( + e) — XO) 


1 
= ——[Ку( + £2, t 6) — Ry(t + £3, 0 — Ry(t, + €,) + Ry(t, 0] 





15 
1 {К 6, t+ 6) — Ry(t e, — Rylt, t +e) – Ку, 0) 
ё) & £i 
CHRy(t, s 
Thus lim E[Y(t; e)Y(t; e] = CAMS =R, 
£1.6270 UOS |=. 


provided 2?R z(t, s)/@t s exists at s = t. Setting e, = £, in Eq. (6.112), we get 


lim E[Y?(t; є,)] = lim ЕГУ; €,)] = А, 


6,70 £270 
and by Eq. (6.111), we obtain 
lim E([Ytt; e) — Y(t; e)]?] = Е, -2R; + К, = 0 


£j,t220 
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(6.109) 


(6.110) 


(6.111) 


(6.112) 


(6.113) 


Thus, we conclude that X(t) has a m.s. derivative X'(t) if 0? R,(t, s}/ðt д5 exists at s = t. If X(t) is WSS, then 


the above conclusion is equivalent to the existence of 0? R,(1)/0?« at т = 0. 


Note: In real analysis, a function g(c) of some parameter e converges to a finite value if 


lim [g(e,) — g(e,)] = 0 


£y, 02720 


This is known as the Cauchy criterion. 


Suppose a random process X(t) has a m.s. derivative X(t). 


(a) Find E[X'(t)]. 
(b) Find the cross-correlation function of X(t) and X'(t). 
(c) Find the autocorrelation function of X'(t). 


(а) Wehave 


E[X'(] = Eti d 


е0 E 
= lim E 
е0 

li Hy(t + в) — дуй) 
= [im —————— ——— 
€ 


0 


E 


[2 + ғ) — xe] 


= ut) 


(6.114) 
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(b) From Eq. (6.17), the cross-correlation function of X(t) and X'(t) is 


Куу, 5) = EFX(0X(S)] = e| ха) Li. m. erac 








c0 
. E[X(0X(s + e)] — E[X(0)X(s)] 
= lim 7 
є—0 
—ш RES HE — Ré s) _ OR alt, 3) (6.115) 
20 ё Os 
(c) Using Eq. (6.115), the autocorrelation function of X(t) is 
Ry(t, 5) = E[X'(0X'()] = lim. хаа) 
£0 
.. ELX(t  9X'(5)] — ELX(OX'(3)] 
-lim ———————————————— 
c0 E 
— lim Ryx(t + & 5) — Ryxlt, $) 
270 E 
OR yx (t, 5) _ O^ Ry(t, s) 
с 80 ддз (6.116) 
If X(t) is a WSS random process and has a m.s. derivative X'(t), then show that 
d 
(а) Ry (0) = 47 Rett) (6.117) 
T 
d 
(0) К,4(т) = — т? R,(t) (6.118) 


(a) For a WSS process X(t), Ry(t, s) = Ry(s — t). Thus, setting s — t = t in Eq. (6.115) of Prob. 6.7, we 
obtain ORy(s — t)/0s = dR,(t)/dt and 


dR 
Ryy (t, t +1) = Куут) = 20) 
dt 
(b Now OR,(s — t)/ðt = —dR,(t)/dt. Thus, 0? Ry(s — 0)/0: 0s = —d?Ry(t)/dt?, and by Eq. (6.116) of Prob. 
6.7, we have 
2 
Ry(t, t+ 1) = Ret) = — 75 Rule) 


Show that the Wiener process X(t) does not have a m.s. derivative. 


From Eq. (5.64), the autocorrelation function of the Wiener process X(t) is given by 


2 


| o?s 1> 5 
Ry(t, s) = o? min(t, s) = ^ les 
д в? {> 5 
T TR =o7u(t — $) = . 
hus 5; КЎ = оці) 1L. (6.119) 


where u(t — s) is a unit step function defined by 


1 t>s 
(735719. pes 


and it is not continuous at s = t (Fig. 6-2). Thus 3?R x(t, 5)/дї ds does not exist at s = t, and the Wiener 
process X(t) does not have a m.s. derivative. 
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6.10. 


6.11. 


u(t — 5) 


0 s D 
Fig. 6-2 Shifted unit step function. 


Note that although a m.s. derivative does not exist for the Wiener process, we can define a generalized 
derivative of the Wiener process (see Prob. 6.20). 


Show that if X(t) is a normal random process for which the m.s. derivative X'(t) exists, then X'(t) 
is also a normal random process. 


Let X(t) be a normal random process. Now consider 


X(t +8) — X(t) 
€ 


Y(t) = 


Then, n r.v.’s Y(t), Yb), Y,(t,) are given by a linear transformation of the jointly normal r.v.'s X(t), 
X(t, + £), X(t), X(t. + £), ..., X(t,), X(t, + £). It then follows by the result of Prob. 5.60 that Y,(t,), Y(t;), 
..., ¥(t,) are jointly normal r.v.’s, and hence Y(t) is a normal random process. Thus, we conclude that the 
m.s. derivative X'(t), which is the limit of Y(t) as € 0), is also a normal random process, since m.s. con- 
vergence implies convergence in probability (see Prob. 6.5). 


Show that the m.s. integral of a random process X(t) exists if the following integral exists: 


| | [Rut В) da df 


À m.s. integral of X(t) is defined by [Eq. (6.8)] 


Үп) = [x da = lim. Y. X(t) At; 


° An-Ó i 


Again using the Cauchy criterion, the m.s. integral Y(t) of X(t) exists if 


2 
lim 2p X(t) At, У X(t) sn] | =0 (6.120) 
i k 


Ап, An -—*0 


As in the case of the m.s. derivative [Eq. (6.7 1 1)], expanding the square, we obtain 
2 
ely ха) At; У X(t) an] | 
i k 


= у у ХОХ) At; At, + YY XG)X(t) At; At, — 2 Y Y. XX) At, an| 
i k i k i k 
-Y Y Rallis t) At; Ay + у Е, t) At; At, — 2 У Куц, t) Ati At 
i k i k i k 
and Eq. (6.120) holds if 


lim Y Y Ry(t; t) At; At, 


Atp AlO i k 
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exists, or, equivalently, 


f Ге, В) da df 


exists. 


6.12. Let X(t) be the Wiener process with parameter o7. Let 


(a) 
(b) 
(a) 


Y(t) = [xo da 


o 
Find the mean and the variance of Y(t). 
Find the autocorrelation function of Y(t). 


By assumption 3 of the Wiener process (Sec. 5.7), that is, E[ X(t)] = 0, we have 


E[Y()] = d (хе a] = [etx da =0 
0 o 


Then Уаг[Ү()] = E[Y?()] = f ['epxenxun da df 
0 JO 


г t 
- | | Ry(a, В) da dB 
о JO 
By Eq. (5.64), Ry(a, B) = а? min(a, f); thus, referring to Fig. 6-3, we obtain 


Var[ Y(t)] = о? f f min(«, В) da df 
0 JO 


‹ B t a c^t? 
=o? [ap Гавно а Гра 
o o о о 3 


Y(t) = [xo da + [oc — X(s)] da + (1 — s)X(s) 


Lett» s > 0 апа write 


= Y(s)+ [те — X(s)] da + (t — s)X(s) 


Then, fort > s 0, 


Куй, з) = ELY()Y()] 
= E(Y*()] + | E{CX(a) — X(s)}¥(9)} da + (t — SELX(GY(G)] 


s 





[CHAP 6 


(6.121) 


(6.122) 


(6.123) 
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Now by Eq. (6.122), 


2.3 


ELY%3)] = Var ¥(9)] = —7- 
Using assumptions 1, 3, and 4 of the Wiener process (Sec. 5.7), and since s < a < /, we have 


[stia — X(s)] ¥(s)} da = [ахо — X(s)] [xo ap} da 
0 


5 d 


= [ [a0 — X(s)|LX(B) — X(0)]} 48 da 
s O0 


= [ [ахы — X(s)] ELX(s) — X(0)] dB da = 0 


0 


Finally, ого < $ < s, 


(t — s)ELX(s) Y(s)] = (t — 5) | E[X(s)X(B)] ав 
0 
=(t-s) [кв В dB =(t 5) fo min(s, f) 4B 
0 o 


i s s? 
=o7(t—s) | BdB=o%(t—s) = 
o 2 


Substituting these results into Eq. (6.123), we get 


2.3 2 1 
Ry(t, 5) = > + с: s) z = 090: 3) 


Since Ry(t, s) = Ry(s, t), we obtain 


ho*s*(3t—s) t>s2>0 
icU8s—0) s>t20 


Куй, 5) = { 


POWER SPECTRAL DENSITY 
6.13. Verify Eqs. (6.13) and (6.14). 


From Eq. (6.12), 
Ryx(t) = E[X(0X(t + т)] 


Setting г + т = s, we get 


Ry(t) = E[X(s — 1)X(s)] = ELX(s)X(s — т)] = К-т) 


Next, we have 


E(LX() + Х@ + т)]?} 20 


Expanding the square, we have 


or 


E[X"(t) + 2X()X(t +t) + Xr+] 20 
E[X?()) + 2ECX(OX(t + 1)] + ELX? + 0] > 0 


Thus 2R,(0) + 28 у(х) > 0 


from which we obtain Eq. (6.14); that is, 


R x(0) > | К,(т)| 


6.14. Verify Eqs. (6.18) to (6.20). 
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(6.124) 


226 ANALYSIS AND PROCESSING OF RANDOM PROCESSES (CHAP 6 


By Eq. (6.17), 
Ryy( 71) = ELX(N Y(t — т)] 
Setting t — t = s, we get 
Ryy(—1) = E(X(s + t)¥(s)] = ELY(s)X(s + т)] = Ry x(t) 
Next, from the Cauchy-Schwarz inequality, Eq. (3.97) (Prob. 3.35), it follows that 
{ЕХО + 01)? x Е[Х(О]Е[ГҮЗ(@ + 0] 
or СК, „(0]? < К,(0)К,0) 
from which we obtain Eq. (6./9); that is, 
|Ryxt)| < RAORAO 
Now E{[X(t) — Y(t + 9]) > 0 
Expanding the square, we have 
ELX) — 2X()Y(t + т) + Ү?г 0] 20 
or E[X^(0] — 2E[X()Y(t + 9] + E[Y?(t + т) = 0 
Thus R4(0) — 2Ryxy(t) + R,() > 0 
from which we obtain Eq. (6.20); that is, 
Куут) < SCR (0) + R,()] 


6.15. Two random processes X(t) and Y(t) are given by 
X(t) = А cos(ot + Ө) — Y(t) = A sin(ot + Ө) 


where А and о are constants and © is a uniform r.v. over (0, 2л). Find the cross-correlation 
function of X(t) and Y(t) and verify Eq. (6.18). 


From Eq. (6.17), the cross-correlation function of X(t) and Y(t) is 
Ryy(t, t + т) = Е[Х(ӨҮ( + 2)] 
= E{A? cos(ot + ©) sin[o(t + т) + ©]} 
А? 
=> E[sin(2ot + wt + 20) — $їп( — оот)] 
А? 
=F sin wr = Ry) (6.125) 
Similarly, 
Ryx(t, t + t) = E[Y(0X(t + т)] 
= E(A? sin(wt + ©) cos[ælt + т) + 9]) 
А? 
= E[sin(2cot + от + 20) + 5іп( — от)] 


А? 
= - sin wt = Ry x(t) (6.126) 


From Eqs. (6.125) and (6.126), we see that 


A? A? 
Куу(—т) = > sin @(—t) = — > sin от = Ry y(t) 


which verifies Eq. (6.18). 


6.16. Show that the power spectrum of a (real) random process X(t) is real and verify Eq. (6.26). 
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6.17. 


From Eq. (6.23) and expanding the exponential, we have 


So) = [Г Ry(x)e ~? dc 


-0 


It 


{ Ry(tXcos от — j sin от) dt 


-‹< 


M 


— 0 


{ Ку(т} cos от dt — j { R(t) sin ox dt (6.127) 


Since R4(—:) = R,(t), Кү(т) cos wt is an even function of t and Ку(т) sin от is ап odd function of t, and 
hence the imaginary term in Eq. (6.127) vanishes and we obtain 


S,(@) = { Кт) cos от dt (6.128) 
which indicates that 5 (о) is real. Since cos( — от) = cos(w7), it follows that 
Sy(-- @) = 5(0) 


which indicates that the power spectrum of a real random process X(t) is an even function of frequency. 


Consider the random process 
Y(t) = (—1)*® 


where X(t) is a Poisson process with rate A. Thus Y(t) starts at Y(O) = 1 and switches back and 
forth from +1 to —1 at random Poisson times T;, as shown in Fig. 6-4. The process Y(t) is 
known as the semirandom telegraph signal because its initial value Y(0) = 1 is not random. 


(a) Find the mean of Y(t). 
(b) Find the autocorrelation function of Y(t). 


(a) We have 


Yo = 1 if X(t) is even 
—1 if X(t) is odd 


Thus, using Eq. (5.55), we have 


P[Y(t) = 1] = P[X(t) = even integer] 
(at)? 


= e + ot | = e^" cosh At 


P[Y(t) = —1] = P[X(t) = odd integer] 
(at)? 


— eu + ЕТИ + | =e sinh И 





Fig. 6-4 Semirandom telegraph signal. 


228 


6.18. 


6.19. 


ANALYSIS AND PROCESSING OF RANDOM PROCESSES [CHAP 6 


Hence By(t) = ЕГУ] = (0PLY() = 1] c (C UPLY() = —1] 
=e “(cosh At — sinh 4t) = e^?? (6.129) 
(b) Similarly, since Y(t)¥(t+ т) = 1 if there are an even number of events in (t, г + т) for т> 0 and 
Ү()Ү( + т) = —1 if there are an odd number of events, then for t > Oand t +t » 0, 
Ry(t, t +7) = E[Y(OY( + 0] 
ас AD)" а, (At)" 
= 1 Av \_ —] rio 
ШАКА m 
w) —À n . . . 
= ey! = e Ate Ач = e 7^ 
which indicates that R,(t, + т) = Ry(x), and by Eq. (6.13), 
Кут) = e 219 (6.130) 


Note that since E[ Y(1)] is not a constant, Y(t) is not WSS. 


Consider the random process 
Z(t) = AY(t) 


where Y(t) is the semirandom telegraph signal of Prob. 6.17 and A is a r.v. independent of Y(t) 
and takes on the values +1! with equal probability. The process Z(t) is known as the random 
telegraph signal. 


(a) Show that Z(t) is WSS. 
(b) Find the power spectral density of Z(t). 
(a) Since E(A) = 0 and E(A?) = 1, the mean of Z(r) is 
Bt) = E[Z(0] = К(А)ЕГҮй)] = 0 (6.131) 
and the autocorrelation of Z(t) is 
Rot, t + т) = E[A?Y()Y(t + т)] = E(A)ECY()Y(t + 0] = Күй, ¢ т) 
Thus, using Eq. (6./30), we obtain 
Rat, t + т) = Rt) = et (6.132) 


Thus, we see that Z(t) is WSS. 


(b) Taking the Fourier transform of Eq. (6.132) (see Appendix B), we see that the power spectrum of Z(t) is 
given by 


(6.133) 


Slo) = —4 
A= ога 


Let X(t) and Y(t) be both zero-mean and WSS random processes. Consider the random process 
Z(t) defined by 


Z(t) = X(t) + Y(t) 
(a) Determine the autocorrelation function and the power spectral density of Z(t), (i) if X(t) and 
Y(t) are jointly WSS; (ii) if X(t) and Y(t) are orthogonal. 


(b) Show that if X(t) and Y(t) are orthogonal, then the mean square of Z(t) is equal to the sum 
of the mean squares of X(t) and Y(t). 


(a) The autocorrelation of Z(t) is given by 
Куй, 5) = ELZ()Z(s)] = Е{[Х(@) + Y(OICX(s) + Ү()]} 
= E[X(0)X(5)] + ELX(QY(s)] + ELY(OX(s)] + EDY(OY(S)] 
= Ry(t, s) + Ryylt, 5) + Ryy(t. 5) + Rgt, 5) 
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(i) If X(r) and Y(t) are jointly WSS, then we have 
Кдт) = Ry(t) + Ryy(t) + Ryy(t) + Кү(т) 
where т = s — t. Taking the Fourier transform of the above expression, we obtain 
5да) = Sx{w) + $уу(ф) + Sym) + Sylw) 
Gi) If X(t) and Y(t) are orthogonal [Eq. (6.2/)], 
Ryy(t) = Куут) = 0 
Then Кдт) = Ry{t) + Кү(т) (6.134a) 
Sz(o) = $у(ш) + Sw) (6.1345) 
(b) Setting т = 0 in Eq. (6.134a), and using Eq. (6.15), we get 
E[Z^()] = ELX?()] + ELY?(0] 


which indicates that the mean square of Z(t) is equal to the sum of the mean squares of X(t) and Y(t). 


WHITE NOISE 


6.20. Using the notion of generalized derivative, show that the generalized derivative X'(t) of the 
Wiener process X(t) is a white noise. 


From Eq. (5.64), 
Ry(t, 5) = о? min(t, s) 


and from Eq. (6.119) (Prob. 6.9), we have 


, 
£ ңа, s) = o?u(t — s) (6.135) 
Os 


Now, using the 6 function, the generalized derivative of a unit step function u(r) is given by 
d 
— u(t) = ó(t 
ji ) = бї) 


Applying the above relation to Eq. (6.135), we obtain 


2 


д 
— =g? — — s} = g?ôlt — 
125 Күй, 5) = с at u(t — s) = o^ó(t — s) (6.136) 


which is, by Eq. (6.116) (Prob. 6.7), the autocorrelation function of the generalized derivative X'(t) of the 
Wiener process X(t); that is, 
RyAt, 5) = о?б( — 5) = о?ё(т) (6.137) 


where т = t — s. Thus, by definition (6.43), we see that the generalized derivative X‘(t) of the Wiener process 
X(t) is a white noise. 

Recall that the Wiener process is a normal process and its derivative is also normal (see Prob. 6.10). 
Hence, the generalized derivative X'(t) of the Wiener process is called white normal (or white gaussian) noise. 


6.21. Let X(t) be a Poisson process with rate 4. Let 
Y(t) = X(t) — А 
Show that the generalized derivative Y'(t) of Y(t) is a white noise. 
Since Y(t) = X(r) — At, we have formally 
Y'() = X(t) — А (6.138) 
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Then E[Y'(0] = EL[X'(0 — 4] = EEX D] А (6.139) 


Күй, 5) = ELY'(OY'()] = ЕГ) ~ ААХ) — А) 
= E[X'(0X'(s) — AX'(s) — АХ) + 4?] 
= E[X'(0X'(] — AELX(s)] — AELX'(0] + 2? (6.140) 
Now, from Eqs. (5.56) and (5.60), we have 


EL X(t)] = åt 
Ry(t, s) = A min(t, s) + А215 
Thus E[X'()] = A and E[X'(3] =A (6.141) 
and from Eqs. (6.7) and (6.137), 
E[X'(0X'(s)] = Кү, s) = PN = Ad(t — s) + 1? (6.142) 
Substituting Eq. (6.141) into Eq. (6.139), we obtain 
E[Y't] =0 (6.143) 
Substituting Eqs. (6.141) and (6.142) into Eq. (6.140), we get 
Ry(t, 5) = Ad(t — 5) (6.144) 


Hence we see that Y'(t) is a zero-mean WSS random process, and by definition (6.43), Y(t) is a white noise 
with т? = 4. The process Y'(t) is known as the Poisson white noise. 


Let X(t) be a white normal noise. Let 
Үш) = [xe da 
0 
(a) Find the autocorrelation function of Y(t). 
(b Show that Y(t) is the Wiener process. 
(а) From Eq. (6.137) of Prob. 6.20, 
R,(t, 5) = a?2ó(t — s) 


Thus, by Eq. (6.11), the autocorrelation function of Y(t) is 


Ry(t, 5) = [ [е B) dB da 
0 J0 


E f [= — В) da dB 
o Jo 


=o? [ч - pdf 
0 
mint, 5) 

= а? | dp = а? min(t, s) (6.145) 
0 


(b) Comparing Eq. (6.145) and Eq. (5.64), we see that Y(t) has the same autocorrelation function as the 
Wiener process. In addition, Y(t) is normal, since X(t) is a normal process and Y(0) = 0. Thus, we conclude 
that Y(t) is the Wiener process. 


Let Y(n) = X(n) + W(n) where X(n) = A (for all n) and A is a r.v. with zero mean and variance 
c2, and W(n) is a discrete-time white noise with average power 02. It is also assumed that X(n) 
and W(n) are independent. 


(a) Show that Y(n) 15 WSS. 
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(b) Find the power spectral density SQ) of Y (n). 


(a) The mean of Y (n) is 
E[Y(] = E[X()] + E[W(] = E(4) + EL[W(n)] = 


The autocorrelation function of Y(n) is 


Ry(n, n + k) = E([X(n) + W(n)]EX(n + К) + Win + KY} 
= E n)X(n + k)] + ELX(n JIEEW (n + А] + E[W(n)] ELX(n + k)] + ЕГИ (п) (п + k)] 


(42) + Ry(k) = са? + o?ó(k) = Ry(k) (6.146) 


Thus Y(n) is WSS. 
(b) Taking the Fourier transform of Eq. (6.146), we obtain 


SA) = 2no ,75(Q) + о? -л<9 <л (6.147) 


RESPONSE OF LINEAR SYSTEMS TO RANDOM INPUTS 


6.24. Derive Eq. (6.58). 
Using Eq. (6.56), we have 
Ry(t, 5) = EL Y(t) ¥(s)] 


= AL h(a)X(t — a) da F h(B)X(s — p) a| 


^ Nh B)ELX(t — a)X(s — B)] da df 


Eu. )R(B)R y(t — a, s — В) da df 


6.25. Derive Eq. (6.63). 
From Eq. (6.62), we have 


- F u h(a)h(B)R y(t + æ — В) da df 


Taking the Fourier transform of Ry(t), we obtain 


So) = [м y(0e 7 дт = [` F F h(x)h(B)R y(t + a — В)е до dB dx 
Letting t + « — f = A, we get 
So) = | І | . | а) В)К (Де 790—880 da dB dà 


_ F haje ap. рет jog ap |” Ry(A)e eg J^ 4) 


= H(—o)H(o)S yc) 
= H*(a)H(o)S (о) = | H(o) |* Sxl) 


6.26. А WSS random process X(t) with autocorrelation function 
R,(t) = e7"! 
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where a is a real positive constant, is applied to the input of an LTI system with impulse 
response 


h(t) = e^ u(t) 


where b is a real positive constant. Find the autocorrelation function of the output Y(t) of the 
system. 


The frequency response Н(о) of the system is 


1 
Н(о) = F [h(t)] Ыр; 


The power spectral density of X(t) is 
2a 
Sx(o) = £[Ry(9)] = org 


By Eq. (6.63), the power spectral density of Y(t) is 
1 2а 
=|H(w)|? = (| 
Syw) = | Н(о) |S) (als) 


_ а 2Ь b 2a 
C (a? — Ь2)Ь \w? +b?) (a? - b*)b No? + а? 


Taking the inverse Fourier transform of both sides of the above equation, we obtain 


1 
(a? — b*)b 





Кт) = (ae ^ Plt! — pet) 


6.27. "Verify Eq. (6.25), that is, the power spectral density of any WSS process X(t) is real and S,(@) > 0. 


The realness of Sy(o) was shown in Prob. 6.16. Consider an ideal bandpass filter with frequency 
response (Fig. 6-5) 
1 w «|o| < 0, 
H(o) = ! 
(о) К otherwise 
with a random process X(t) as its input. 
From Eq. (6.63), it follows that the power spectral density 5;(о) of the output Y(t) equals 


S,(@) о; «|o| «o; 


Sy(@) = i 


0 otherwise 


Hence, from Eq. (6.27), we have 


© @2 


1 
Sw) do = 2 — | S,(@) do > 0 
2л 


© 


1 
ELY*()] = > | 


-æ 


which indicates that the area of $,(@) in any interval of о is nonnegative. This is possible only if S,(w) > 0 
for every o. 


Hw) 
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6.28. 


Verify Eq. (6.72). 
From Eq. (6.71), we have 


КЮ) = Y y МОРОК, AK + 1—1) 


i2—ocl-2-ouo 


Taking the Fourier transform of Ry(k), we obtain 


а; © a 


SIM = Y кй, == Y Y Y ADMD (Kk + i— De 


k=-a k=- i=- l=- 


Letting k + i — l = n, we get 


Sy(Q) 


m--wi--wis-m 


H(—0)H(90)8 (0) 


= H*(Q)H(0)5,(Q) = | HQ)? SAN) 
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У Y Y AAR y (me i-i 


X мде У he"! У Ryne 


i-o {= = о n=- 
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6.29. The discrete-time system shown in Fig. 6-6 consists of one unit delay element and one scalar 
multiplier (a < 1). The input X(n) is discrete-time white noise with average power c?. Find the 


spectral density and average power of the output Y(n). 


X(n) 


From Fig. 6-6, Y(n) and X(n} are related by 
Y(n) = aY(n — 1) + X(n) 
The impulse response h(n) of the system is defined by 
h(n) = ah(n — 1) + ó(n) 
Solving Eq. (6.149), we obtain 
h(n} = a"u(n) 


where u(n) is the unit step sequence defined by 


un) = 1 n>0 
5710 n<0 


Taking the Fourier transform of Eq. (6.150), we obtain 


1 


= по Жл 
H(Q) У а"е i-a 


a=0 


- ja 





a«1,|Q| «n 


(6.148) 


(6.149) 


(6.150) 
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6.30. 


6.31. 
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Now, by Eq. (6.48), 
540) = а? 12| «m 
and by Eq. (6.72), the power spectral density of Y(n) is 
SAD) = | H(Q))?S,(Q) = Н(О)Н( —О)$у(О) 


в? 


~ (1 — ае PXI = ae?) 
а? 


ржа? —2a cos © 








[9] «n 


Taking the inverse Fourier transform of Eq. (6.151), we obtain 


в? 
1—4? 





Ry(k) = alk! 


Thus, by Eq. (6.33), the average power of Y(n) is 





ЕГҮЗ0)] = Ry(0) = — 
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(6.151) 


Let Y(t) be the output of an LTI system with impulse response A(t), when X(t) is applied as input. 


Show that 
(a) Rat, з) = [ ABR, з — В dp 
(b) Ry(t, s) = [` h(x)Ryy(t — a, s) da 


(a) Using Eq. (6.56), we have 


Ryy(t, 5) = ELX(OY()] = Af xin | h(B)X(s — В) ap] 


- Г MBIELX(X(s — B] 48 = | С щЮкуй,з— f) dp 


(b) Similarly, 


Ry(t, s) = ELY()¥(s)] = rf | ° h(a) X(t — a) da ro] 


= E h(a)E[X(t — a)Y(s)] da = [` h(a)Ryy(t — a, s) da 


-œ 


(6.152) 


(6.153) 


Let Y(t) be the output of an LTI system with impulse response h(t) when a WSS random process 


X(t) is applied as input. Show that 


(а) Syy(@) = Н(о)5 (о) 
(b) 50) = H*(@)Syy(w) 
(a) If X(t) is WSS, then Eq. (6.152) of Prob. 6.30 becomes 


Ryy(t 5) = | h(B)Rx(s — t — f) ав 


which indicates that R xy(t, s) is a function of the time difference т = s — t only. Hence 


Кум) = | “АВК е В) df 


(6.154) 
(6.155) 


(6.156) 


(6.157) 
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Taking the Fourier transform of Eq. (6.157), we obtain 


$хүу(ю) = { Ry fale! dc = [ r h(B)Ry(x — Bye ^?" dB ат 


= [ [ h(B)Ry(4)e 2A * dB da 


- [ h(B)e" 4°? ар [ Ry(A)e і da = Н(о),(о) 


(b) Similarly, if X(t) is WSS, then by Eq. (6.156), Eq. (6.153) becomes 
Ry(t, 5) = [Г h(a)Ryy(s — t + a) da 
which indicates that R (t, s) is a function of the time difference т = s — t only. Hence 
Ry) = { * Ha)Ryyl(t + а) da 


Taking the Fourier transform Кү(т), we obtain 


5,(0) = [Г Кү()е ^" dt = [Г [ h(a)R у (т + а)е ^" da ат 


= | | h(a)Ryy(4)e TA- da dd 


= r haje da [ Ryy(A)e + da 


— -%0 


= H(—0)S (о) = H*(o)Syy(@) 
Note that from Eqs. (6.154) and (6.155), we obtain Eq. (6.63); that is, 
Sy(w) = Н*(0)5 (о) = H*(w)H(w)S,(@) = | Hw) | Sx) 
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(6.158) 


6.32. Consider a WSS process X(t) with autocorrelation function R,(t) and power spectral density 


Sdo). Let X'(t) = dX(t)/dt. Show that 


d 
(а) Куут) = d Ry(t) 


d? 
() Reli) = — 25 Ri) 


(c) Sy{w) = w7S,(w) 


(6.159) 


(6.160) 


(6.161) 


(a) If X(t) is the input to a differentiator, then its output is Y(t) = X(t). The frequency response of a 


differentiator is known as H(w) = jw. Then from Eq. (6.154), 
Syy (0) = H(o)Sy(o) = јо (о) 


Taking the inverse Fourier transform of both sides, we obtain 


d 
Rxy (0) = a Ry(x) 


(b) From Eq. (6.155), 
Sylw) = H*(@)Syx(@) = —-joSyy 40) 
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Again taking the inverse Fourier transform of both sides and using the result of part (a), we have 


К = 4 R j= d? 
x(t) = 7 di xx (0 = de 


Ry{t) 
(c) From Eq. (6.63), 
Sx (w) = | Н(о) 25 (о) = | jo S ,(0) = c? Sy(o) 


Note that Eqs. (6.759) and (6.160) were proved in Prob. 6.8 by a different method. 


FOURIER SERIES AND KARHUNEN-LOÉVE EXPANSIONS 
6.33. Verify Eqs. (6.80) and (6.81). 
From Eq. (6.78), 


1 ft . 
X, = = { X(t)e “о dt оо = 2n/T 
T Jo 


Since X(t) 15 WSS, E( X(0)] = uy, and we have 


r 
E(X,) = { E[X(t)]e т dt 
о 


т 
= | e^ P" di = uy (n) 


Again using Eq. (6.78), we have 


ү 


1 [7 | 
ЕХ, X3) = d». { X*(s)eimmos as 
T Jo 


1 (7 ‚ 
=— | ELX , X*(s)]e/""s ds 
T Jo 


1 


Now E{X, X*(s)] = 5 | X (pe! dt x] 
D 
|. (* 
== { E(X(0X*(s)]e 7"®® di 
T Jo 
| ft А 
= — { Куй — sje" dt 
0 


T 


Letting  — s = т, and using Eq. (6.76), we obtain 
1 (7 | 
EX, X*t)] = т { Ryltje MOD de 


о 


1 (7 А ; А 
= f | Буте ior dene = Cy e was (6.162) 
o 


| [7 А А 
Thus E(X, xà — T { Ce Inwospimaos ds 
О 


1 fT, 
| e Jn тов ds = c ó(n — m) 


= 065 
о 


Т 


6.34. Let X(t) be the Fourier series representation of X(t) shown in Eq. (6.77). Verify Eq. (6.79). 


CHAP. 6] ANALYSIS AND PROCESSING OF RANDOM PROCESSES 237 


From Eq. (6.77), we have 


} 
= e| xo — y X, e | xy _ y xterm | 


= Е[|Х(01— Y E[XtX(0]e-^"" 


п= - а 


У E[X, X *(tr)]e^"v + y У E[X, X5 ]e/" meo 


п= - 0 n--om--wuw 


Now, by Eqs. (6.81) and (6.162), we have 





ціхо — Xor) = e[ xo- Ў X, eio! 


E[X*X()] = cte 
E[X, X*(1)] = c, e == 
E(X, X7) = Cy O(n — m) 


Using these results, finally we obtain 


ELIXA- 00) = R40— Y et- Y o4 Y «20 


п= – 0 ncm п= – =. 


since each sum above equals R,(0) [see Eq. (6.75)]. 


6.35. Let X(t) be m.s. periodic and represented by the Fourier series [Eq. (6.77) ] 


X(t) = y Xe o = 2n/T, 


п= – о 


Show that 
EC} X(t) |? - у BUX, 2) (6.163) 


From Eq. (6.81), we have 
E(|X,|?) = E(X, XP) = с, (6.164) 
Setting т = 0 in Eq. (6.75), we obtain 


EL|X(?]=R(0)= У c7 X EUIX,P) 


n--a n--ao 





Equation (6.163) is known as Parseval's theorem for the Fourier series. 


6.36. If a random process X(t) is represented by a Karhunen-Loéve expansion [Eq. (6.82)] 
X(t) = PEL O<t<T 
and X,'s are orthogonal, show that ¢,(t) must satisfy integral equation (6.86); that is, 
[Rut sjo (s) ds = A, olt) Oct«T 
Consider 


X(x; = X X, Xa P(t) 
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6.38. 
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Then E[X()X7] = У ЕХ, X7]6,(0 = ECX, PGA 
т= 1 
since X ,'s are orthogonal; that is, E(X,, X*) = 0 if m 4 n. But by Eq. (6.84), 


E[X()X7] = e| xw [хее i| 
0 


T 
- | ELX()X *(s)1¢,(s) ds 
о 
= f Rae 5)ф, (5) ds 
О 
Thus, equating Eqs. (6.165) and (6.166), we obtain 


T 
| Ry(t, S$ (s) ds = E X PAD = A, bolt) 
o 


where 4, = E(| X,I?). 


Let X(t) be the Karhunen-Loéve expansion of X(t) shown in Eq. (6.82). Verify Eq. (6.88). 


From Eqs. (6.166) and (6.86), we have 


T 
E[X(Xz] = | Ry(t. 5$)ф„(5) ds = А, $, (t) 
0 


Now by Eqs. (6.83), (6.84), and (6.167) we obtain 
T 


T 
E(X, X7) = ef X(pr(t) dt x3] -| E[X(0X7 lore) dt 
о 


0 
т т 

= | Am Om) dt = Am | ф„(0ф@) dt 
o o 

= A,,d(m — n) = A, d(n — m) 


Let X(t) be the Karhunen-Loéve expansion of X(t) shown in Eq. (6.82). Verify Eq. (6.85). 


From Eq. (6.82), we have 


E[1X() — X()|?] = e| xo - Э X, d.t) 





| 


- ef) x -F х.) [ino =) xseno | 
n=l п= 1 


= ELI XP] — Y. EDxOXTIéz( 


- Y ELX*OX,14,00 + У У EGG ADS, (OKO 


Using Eqs. (6.167) and (6.168), we have 


ELLX() – £01] = Куй, 0— 24.640620 — У A264002(0 + У А, Ф900) 


=0 
since by Mercer’s theorem [Eq. (6.87)] 


Rt, = E A, A000) 
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(6.165) 


(6.166) 


(6.167) 


(6.168) 
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6.39, 


апа A, = E(|X,|?) = 2? 


Find the Karhunen-Loéve expansion of the Wiener process X(t). 


From Eq. (5.64), 
o*s s<t 


R,(t, 5) = o? min(t, s) = 
alt, s) (8) oc  s»t 


Substituting the above expression into Eq. (6.86), we obtain 


T 
в? | min(t, s)d,(s) ds = A, d. (t) 0<1<T (6.169) 
o 


t T 
or в? | эф (5) ds + с? | ф,(5) ds = А„ф„() (6.170) 
0 t 
Differentiating Eq. (6.170) with respect to t, we get 
T 
о? | Pals) ds = A, (t) (6.171) 
Differentiating Eq. (6.171) with respect to t again, we obtain 
в? 
60) + = $,(0) = 0 (6.172) 


A general solution of Eq. (6.172) is 
$t) = a, sin @,t + b, cos w,t oO, = UNEN 


In order to determine the values of a,, b,, and A, (or w,), we need appropriate boundary conditions. From 
Eq. (6.170), we see that $,(0) = 0. This implies that b, = 0. From Eq. (6.171), we see that ФТ) = 0. This 
implies that 





с (2п – Dx. (n—4)n 





= = — = = 1, 2,... 
Oni, T T п 
Therefore the eigenvalues are given by 
oT? 
= =1,2,... 6.173 
"ера" (6173 
The normalization requirement [Eq. (6.83)] implies that 
T aT 2 
i t}? dt = 7— = 1 = f= 
f (a, sin c, t) 5 эа, T 


Thus, the eigenfunctions are given by 


Ф) = (А si - 2) T t O<t<T (6.174) 


and the Karhunen-Loéve expansion of the Wiener process X(t) is 


22 1\ я 
= dE i “ja 6.175 
X(t) f ix sn AER Oct«T ( ) 


X, = Ё [xo sif — 5) T t 


and they are uncorrelated with variance 4,. 


where X, are given by 
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6.40. Find the Karhunen-Loéve expansion of the white normal (or white gaussian) noise W(t). 
From Eq. (6.43), 
Ry(t, s) = a?ó(t — s) 


Substituting the above expression into Eq. (6.86), we obtain 
T 
а? | ó(t — 5)ф,(5) ds = A, o.) Oct«T 
0 


or [by Eq. (6.44)] 
a?^$.(t) = A, Palt) (6.176) 


and ó,(f) are arbitrary. Thus, any complete orthogonal set {ф„()} with 
? can be used in the Karhunen-Loéve expansion of the white gaussian 


which indicates that all 4, = о? 


corresponding eigenvalues 4, = с 
noise. 


FOURIER TRANSFORM OF RANDOM PROCESSES 
6.41. Derive Eq. (6.94). 
From Eq. (6.59), 


X(@,) = F Х(ђет іе" di Xt.) - f. X(sje оз» ds 
Then Raw, о) = E(X(0,)X*(o,)] = { f Г X()X (se - 7029 dt as 


-f |  E[X()X *s)]e 7929 de ds 
7 | | Куй, s)e Hottest dr ds = Ёо}, —@,) 


in view of Eq. (6.93). 


6.42. Derive Eqs. (6.98) and (6.99). 


Since X(t) is WSS, by Eq. (6.93), and letting — s = т, we have 


a o 
Rw, 0) = | | Ку — s)e Hots) dt ds 


% 
= | Ry(x)e 2°" dt | е9 + оз qs 
-a 


= (01) | e Jm ten ds 
From the Fourier transform pair (Appendix B) 1 > 27é(w), we have 


| егі dt = 2nó(o) 
Hence Ёо), ©.) = 2л$ lo Alo + оз) 
Next, from Eq. (6.94) and the above result, we obtain 


Rg(o,. w) = Rye, — 0) = 2n$,(0,)ó(o, — c) 
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6.43. 


6.44. 


Let X(w) be the Fourier transform of a random process X(t). If X(c) is a white noise with zero 
mean and autocorrelation function q(w,)é(w, — œz), then show that X(t) is WSS with power 
spectral density q(w)/27. 


By Eq. (6.91), 
1 [* . А 
X(t) 2 — | Xwe da 
2n j- 


га 


Then ELX(0] = = | . EL Žo do = 0 (6.177) 


-w 


Assuming that X(t) is a complex random process, we have 


Ry(t, s) = ELX(OX*(s)] = [= p E Žo X ое deo, do, | 
= 1 = * Е: Ў(о ОЎ *( Jior- ons) d d 
^a d [X( (o,)]e w, dw, 


1 (5 [* | 
Ta | | qow, — c,)e?" dw, do, 


po 
1 © А 
= — | qa, je" do, (6.178) 


which depends only on t — s = т. Hence, we conclude that X(t) is WSS. Setting t — s = т and о, = о in Eq. 
(6.178), we have 


| (* 1 А 
= —— дәт d = — jer d 
Ry(*) vt о)е w n "dz q(a a а 


- 
in view of Eq. (6.24). Thus, we obtain Sela) = q(w)/2n. 
Verify Eq. (6.104). 
By Eq. (6.100), 
X(Q)- Y X(me ?" X*(Q) = Y X*(mpe? 
Then Rg(Q,, Q,) = ELX(Q,)X*(Q,)] = Y Y E(X(n)X *(m)]e - £0 7 80m 


a--om--u 


= У У К, (п, m)e Jot Cr aam] = RAQ, -Qj) 


in view of Eq. (6.103). 


6.45. Derive Eqs. (6.105) and (6.106). 


If X(n) is WSS, then Ry(n, т) = Ry(n — m). By Eq. (6.103), and letting n — m = k, we have 
R4Q,Q)- Y У Rdn- me ntm 


= Y Ry(k)e 7 79 y g fh + Qa)m 


к= –- 0 т=- а. 


Ж 
О) У eg Kf tenm 


т=—х 
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6.46. 


6.47. 


6.48. 


6.49. 


6.50. 


ANALYSIS AND PROCESSING OF RANDOM PROCESSES 


From the Fourier transform pair (Appendix B) x(n) = 1 < 226(Q), we have 


У етт 00 Z2 226(0, + 0,) 


Непсе ЕХО, Q2) = 2nS,(Q,)5(Q, + Qj) 
Next, from Eq. (6.704) and the above result, we obtain 
Rx(Qy,, О, = RQ, —Q)) = 21$,(0,)0(Q, — Q) 


Supplementary Problems 


Is the Poisson process X(t) m.s. continuous? 
Hint: Use Eq. (5.60) and proceed as in Prob. 6.4. 
Ans. Yes. 


Let X(t) be defined by (Prob. 5.4) 
X(t) = Y cos wt t>0 


where Y is a uniform r.v. over (0, 1) and @ is a constant. 


(а) Is X(t) m.s. continuous? 
(b Does X(t) have a ms. derivative? 


Hint: Use Eq. (5.87) of Prob, 5.12. 
Ans. (a) Yes; (b) yes. 


Let Z(t) be the random telegraph signal of Prob. 6.18. 


(a) Is Z(t) m.s. continuous? 
(b) Does Z(t) have a m.s. derivative? 


Hint: Use Eq. (6.132) of Prob. 6.18. 
Ans. (a) Yes; (b) no. 


Let X(t) be a WSS random process, and let X(t) be its m.s. derivative. Show that E[X(r)X'(t)] = 0. 


Hint: | Use Eqs. (6.13) [or (6.14)] and (6.117). 


2 r* Tj2 
Let Z(t) = т | X(a) da 


where X(t) is given by Prob. 6.47 with о = 27/Т. 


(a) Find the mean of Z(t). 
(b) Find the autocorrelation function of Z(t). 


1, 
Ans. (a) — = sin ot 
п 


4 
(b) Кд, 5) = —34 sin ct sin ws 
3n 
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6.51. 


6.52. 


6.53. 


6.54. 


6.55. 


6.56. 


Consider a WSS random process X(t) with E[X(t)] = uy. Let 


1 Tj2 
ot» - | X(t) dt 


-Ti2 


The process X(t) is said to be ergodic in the mean if 


Lim. (X(t)» ; = Е[Х()] = ux 


Toa 
Find E[4X(t)»4]. 
Ans. Hy 


Let X(t) = A cos(w t + Ө), where A and о, are constants, © is a uniform r.v. over (—, л) (Prob. 5.20). 
Find the power spectral density of X(t). 


А?п 
Ans. S,(@) = EB [d(w — wo) + ó(o + €x] 


A random process Y(t) is defined by 
Y(t) = AX(t) cos(w, + ©) 
where A and œw, are constants, Ө is a uniform r.v. over (— z, л), and X(t) is a zero-mean WSS random 


process with the autocorrelation function R,(t) and the power spectral density S,(w). Furthermore, X(t) 
and © are independent. Show that Y(t) is WSS, and find the power spectral density of Y(t). 


2 
Ans. 80) = E [Selo — о,) + Sy(w + @,)] 


Consider a discrete-time random process defined by 
X(n) = У a; cos(Q;n + ©, 
i=l 
where а; and Q; are real constants and Ө, are independent uniform r.v.’s over (— л, л). 


(a) Find the mean of X(n). 
(b) Find the autocorrelation function of X(n). 


Ans. (a) E[X(m]-0 


(b) Ry(nn-c-k)- 5 


a? cos(Q;k) 
=1 


Consider a discrete-time WSS random process X(n) with the autocorrelation function 
Ry(k) = 10e %51% 
Find the power spectral density of X(n). 


6.32 
Ans. 500) = r3 -120csQ0 SF 


Let X(t) and Y(t) be defined by 
X(t) = U cos ost + V sin wot 
Y(t) = V cos wt — U sin wot 
where «y is constant and U and V are independent r.v.'s both having zero mean and variance о?. 
(a) Find the cross-correlation function of X(t) and Y(t). 
(b) Find the cross power spectral density of X(t) and Y(t). 
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6.57. 


6.58. 


6.59. 


6.60. 


6.61. 


6.62. 


6.63. 
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Ans. (а) Ryy(t, + т) = — о? sin wot 
(Б) Syy(w) — ja?n[0(c — wo) — Sw + wo)] 


Verify Eqs. (6.36) and (6.37). 

Hint: Substitute Eq. (6.18) into Eq. (6.34). 

Let Y(t) = X(t) + W(t), where X(t) and W(t) are orthogonal and W(t) is a white noise specified by Eq. (6.43) 
or (6.45). Find the autocorrelation function of Y (t). 


Ans. Ryt, s) = Ку, s) + 0?ó(t — s) 


A zero-mean WSS random process X(t) is called band-limited white noise if its spectral density is given by 


io |o] < ов 


$ = 
x) = 4 lol > wy 
Find the autocorrelation function of X(t). 


No Wz, Sin (gt 
Ans. Ry(t) = 0—8 —— 
2л Wat 


A WSS random process X(t) is applied to the input of an LTI system with impulse response A(t) = 3e7 ?'u(t). 
Find the mean value of Y(t) of the system if E[ X(t)] = 2. 

Hint: Use Eq. (6.59). 

Ans. 3 

The input X(t) to the RC filter shown in Fig. 6-7 is a white noise specified by Eq. (6.45). Find the mean- 
square value of Y(t). 

Hint: Use Eqs. (6.64) and (6.65). 

Ans. o°/(2RC) 


X(t) € yy 


Fig. 6-7 RC filter. 


The input X(z) to a differentiator is the random telegraph signal of Prob. 6.18. 


(a) Determine the power spectral density of the differentiator output. 
(b) Find the mean-square value of the differentiator output. 
440? 
o + 442 
(b E[Y*()] = оо 


Ans. (а) Sy(a) = 


Suppose that the input to the filter shown in Fig. 6-8 is a white noise specified by Eq. (6.45). Find the power 
spectral density of Y (t). 


Ans. Sylw) = o7(1 + a? + 2a cos wT) 
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6.64, 


6.65. 


6.66. 


6.67. 


6.68. 





Fig. 6-8 


Verify Eq. (6.67). 
Hint: Proceed as in Prob. 6.24. 


Suppose that the input to the discrete-time filter shown in Fig. 6-9 is a discrete-time white noise with 
average power o?. Find the power spectral density of Y(n). 


Ans. Sy(Q) = o?(1 + a? + 2a cos О) 


X(n) Y(n) 





Fig. 6-9 


Using the Karhunen-Loéve expansion of the Wiener process, obtain the Karhunen-Loéve expansion of the 
white normal noise. 


Hint: Take the derivative of Eq. (6.175) of Prob. 6.39. 


2 д 1 
Ans. [e Ene O«r«T 


where W, are independent normal r.v.’s with the same variance o7. 
Let Y(t) = X(t) + W(t), where X(t) and W(t) are orthogonal and W(t) is a white noise specified by Eq. (6.43) 
or (6.45). Let $,(t) be the eigenfunctions of the integral equation (6.86) and 4, the corresponding eigenvalues. 


(a) Show that $,(1) are also the eigenfunctions of the integral equation for the Karhunen-Loéve expansion 
of Y(t) with Ry(t, s). 


(b) Find the corresponding eigenvalues. 
Hint: | Use the result of Prob. 6.58. 
Ans. (b 4, +o? 


Suppose that 
Хи) = У X, ev 


where X, are r.v.’s and oy is a constant. Find the Fourier transform of X(t). 


Ans. X(@) = У 2nX, о — noo) 
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6.69. Let ¥(w) be the Fourier transform of a continuous-time random process X(t). Find the mean of X(c). 


© 


Ans. F[py(t)) = | By(t)e >” dt where y(t) = E[X(0] 


-— 


6.70. Let 
Х(О)= Y Xine 
where Е[Х(п)] = 0 and E(X(n)X(k)] = ¢,? 6(n — k). Find the mean and the autocorrelation function of 
X(Q). 


Ans. E[X(Q] = 0 RQ, Q,— Y, oe /Q-83» 


n=— а) 


Chapter 7 


Estimation Theory 


71 INTRODUCTION 


In this chapter, we present a classical estimation theory. There are two basic types of estimation 
problems. In the first type, we are interested in estimating the parameters of one or more r.v.'s, and in 
the second type, we are interested in estimating the value of an inaccessible r.v. Y in terms of the 
observation of an accessible r.v. X. 


72 PARAMETER ESTIMATION 


Let X be a r.v. with pdf f(x) and Х,,..., X, a set of n independent r.v.'s each with pdf f(x). The 
set of r.v.’s (Xi, ..., X,) is called a random sample (or sample vector) of size n of X. Any real-valued 
function of a random sample s(X,,..., X,) is called a statistic. 

Let X be a r.v. with pdf f(x; 6) which depends on an unknown parameter 0. Let (X,, ..., X,) bea 
random sample of X. In this case, the joint pdf of X,,..., X, is given by 


L(x; 8) = /(х,..., х„; 8) = [| 70; 9 (7.1) 
i=t 
where x,,..., x, are the values of the observed data taken from the random sample. 
An estimator of Ө is any statistic s(X , ..., X,), denoted as 
© = s(X,, ..., Xj) (7.2) 
For a particular set of observations X, = x,,..., X, = x,, the value of the estimator s(x,, ..., x,) will 


be called an estimate of 0 and denoted by 6. Thus an estimator is a r.v. and an estimate is a particular 
realization of it. It is not necessary that an estimate of a parameter be one single value; instead, the 
estimate could be a range of values. Estimates which specify a single value are called point estimates, 
and estimates which specify a range of values are called interval estimates. 


73 PROPERTIES OF POINT ESTIMATORS 
A. Unbiased Estimators: 
An estimator © = s(X,,..., X,) is said to be an unbiased estimator of the parameter 6 if 
E(®) = 0 (7.3) 
for all possible values of 0. If © is an unbiased estimator, then its mean square error is given by 
EKO — 6)°] = Е([Ө — E(©)]*} = Var(O) (7.4) 


That is, its mean square error equals its уапапсе. 


B. Efficient Estimators: 


An estimator ©, is said to be a more efficient estimator of the parameter 0 than the estimator ©, 
if 
1. Ө, and ©, are both unbiased estimators of б. 
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2. Var(0,) < Var(@,). 


The estimator Өџу = s(X,, ..., X,) is said to be a most efficient (or minimum variance) unbiased 
estimator of the parameter @ if 


1. Itis an unbiased estimator of 0. 
2. Var(Oyy) < Var(9) for all Ө. 


C. Consistent Estimators: 


The estimator ©, of 0 based on a random sample of size n is said to be consistent if for any small 
£ 7 0, 


lim P(jO, - 0| « е) = 1 (7.5) 
or equivalently, 
lim РӨ, — @|>e) 20 (7.6) 


The following two conditions are sufficient to define consistency (Prob. 7.5): 


1. lim E(9,) = 0 (7.7) 
2. lim Var(9,) 20 (7.8) 


74 MAXIMUM-LIKELIHOOD ESTIMATION 


Let f(x; 0) = f(x, ..., Xa; 0) denote the joint pmf of the r.v.’s X,,..., X, when they are discrete, 
and let it be their joint pdf when they are continuous. Let 


LO) = f(x; 0) = /(ху,..., x4: Ө) (7.9) 


Now L(0) represents the likelihood that the values x,, ..., x, will be observed when Ó is the true value 
of the parameter. Thus L(0) is often referred to as the likelihood function of the random sample. Let 
Our = (х1, ..., X,) be the maximizing value of L(0); that is, 


L(0,,,) = max LO) (7.10) 
8 


Then the maximum-likelihood estimator of 8 is 
Our = s(X,, -3 X) (7.11) 


and 0,,, is the maximum-likelihood estimate of 8. 

Since L(0) is a product of either pmf's or pdf's, it will always be positive (for the range of possible 
value of 0). Thus In L(0) can always be defined, and in determining the maximizing value of б, it is 
often useful to use the fact that L(0) and In L(0) have their maximum at the same value of 0. Hence, 
we may also obtain 6,,, by maximizing In L(0). 


75 BAYES’ ESTIMATION 


Suppose that the unknown parameter @ is considered to be a r.v. having some fixed distribution 
or prior pdf f(0). Then f(x; 0) is now viewed as a conditional pdf and written as f(x |0), and we can 
express the joint pdf of the random sample (X,, ..., X,) and 0 as 


fx х 8 =S (Xis s Xa OSO) (7.12) 
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and the marginal pdf of the sample is given by 


fes cnp = | ftem sos di (7.13) 
Re 


where К, is the range of the possible value of Ө. The other conditional pdf, 


f(x, es x, 0) f(x, ..., x,18)f(0) 
f(8| x4, ..., x,) = ————— B = А 7.14 
Ini f(x, ..., x) /(х\,...› Xp) (7.14) 
is referred to as the posterior pdf of 0. Thus the prior pdf f(0) represents our information about @ 
prior to the observation of the outcomes of X,, ..., X,, and the posterior pdf f(0|x,, ..., x,) rep- 
resents our information about 0 after having observed the sample. 
The conditional mean of 6, defined by 


05 = E(0xi, ..., Xp) -| 0f(8|x,, ..., x,) 49 (7.15) 
Re 


is called the Bayes' estimate of 0, and 
©, = E(0| X,, ..., X,) (7.16) 


is called the Bayes’ estimator of Ө. 


7.6 MEAN SQUARE ESTIMATION 


In this section, we deal with the second type of estimation problem—that is, estimating the value 
of an inaccessible r.v. Y in terms of the observation of an accessible r.v. X. In general, the estimator Y 
of Y is given by a function of X, g(X). Then Y — Y = Y — g(X) is called the estimation error, and 
there is a cost associated with this error, СГУ — g(X)]. We are interested in finding the function g(X) 
that minimizes this cost. When X and Y are continuous r.v.'s, the mean square (m.s.) error is often 
used as the cost function, 


CEY — 9(Х)] = E{LY — 9001 (7.17) 
It can be shown that the estimator of Y given by (Prob. 7.17), 
Y = g(X) = E(Y| X) (7.18) 


is the best estimator in the sense that the m.s. error defined by Eq. (7.17) is a minimum. 


7/7 LINEAR MEAN SQUARE ESTIMATION 


Now consider the estimator Y of Y given by 


Y-g(X)-aX +b (7.19) 
We would like to find the values of a and b such that the m.s. error defined by 
e = E[((Y — Y] = E([Y — (aX + Ь]?} (7.20) 
is minimum. We maintain that a and b must be such that (Prob. 7.20) 
E{LY — (aX + b)]x} 20 (7.21) 
and a and b are given by 
Oxy бү 


а= —5 = су Pxy b = py — apy (7.22) 
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and the minimum m.s. error e,, is (Prob. 7.22) 
Cm = Oy (1 — pxy?) (7.23) 


where ogy = Cov(X, Y) and руу is the correlation coefficient of X and Y. Note that Eq. (7.21) states 
that the optimum linear m.s. estimator Y = aX + b of Y is such that the estimation error Y — Î = Y — 
(aX + b) is orthogonal to the observation X. This is known as the orthogonality principle. The line 
y = ax + b is often called a regression line. 

Next, we consider the estimator Y of Y with a linear combination of the random sample 
(Xise Xn) by 


Y= $ aX; 


i 
i-1 


(7.24) 


Again, we maintain that in order to produce the linear estimator with the minimum m.s. error, the 
coefficients a; must be such that the following orthogonality conditions are satisfied (Prob. 7.35): 


d(r- Lax) |-o j-doan (7.25) 
i=) 


Solving Eq. (7.25) for a;, we obtain 


a= КЬ (7.26) 
where 
а, b, Ry Ri, 
a-|: b-|: Бу = ЕҮХ) К=| : ^ 4i Ri, = E(X; Xy) 
a, b, К, Run 
and R`! is the inverse of R. 
Solved Problems 


PROPERTIES OF POINT ESTIMATORS 


7.1. Let (X,,..., X,) be a random sample of X having unknown mean д. Show that the estimator of 
u defined by 


-LyX-x (7.27) 


is an unbiased estimator of u. Note that X is known as the sample mean (Prob. 4.64). 
By Eq. (4.108), 


12 g DS 1 
nn = n Хх) 1 нка ии 


Thus, M is an unbiased estimator of д. 


7.2. Let (X, ..., X,) be a random sample of X having unknown mean и and variance o?. Show that 
the estimator of c? defined by 


X (X, — Xy (7.28) 
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7.3. 


7.4. 


where X is the sample mean, is a biased estimator of o°. 
By definition, we have 


а? = E((X; — ny] 


=l— 


Now E(S?) = e| (x, - x] - eli EX — w) — (8 - wr} 
ї= 1 


i Ma iMa 


=i 


ll 
m 
re 


Кх — a? — XX; — ИХ — u) + (X — 2 


= р 


ll 

th 
А 
L. 
ima 


(X; — p? — n(X — ^) 


mI— 


[(X, — 4] — E((X — 2] = e? — og? 


ipa 
isi 


By Eqs. (4.112) and (7.27), we have 





ї=1 Л 
2 21221-1, 
Thus E(S*) = о? – = с? = т с (7.29) 
which shows that $? is a biased estimator of o°. 
Let (X, ..., X,) be a random sample of a Poisson r.v. X with unknown parameter 4. 


(a) Show that 


are both unbiased estimators of 4. 
(b) Which estimator is more efficient? 
(a) By Eqs. (2.42) and (4.108), we have 
12 1 
ЕА) =- Y EX) = (nd) = А 
n c. n 
E(^;) = $EE(X ) + E(X2)] = 2А) = à 
Thus, both estimators are unbiased estimators of A. 
(b) By Eqs. (2.43) and (4.112), 


i 2 ] g 1 A 
аЛ) = = У Мацх) = = È Мац) = = (nd) = - 
і=1 i= 


д 
Var(A^j) = Q3) 7 7 


Thus, if n > 2, A, is a more efficient estimator of å than А», since A/n < 4/2. 
Let (X,, ..., X,) be a random sample of X with mean и and variance o?. A linear estimator of y 


is defined to be a linear function of X,, ..., X,, (X,, ..., Х,). Show that the linear estimator 
defined by [Eq. (7.27)], 
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is the most efficient linear unbiased estimator of р. 


Assume that 
M, =((X,,..., X,) = Уа Х; 
і= 1 


is a linear unbiased estimator of u with lower variance than M. Since М, is unbiased, we must have 


ЕМ) = Ya E(X) = иў 


i=l i=] 


which implies that $7. , a; = 1. By Eq. (4.112), 


Ul 
E 
m 
Е 

ll 
* 


1 
Var(M) = — о? and — Var(M,) = e? Y aj? 
n 
By assumption, 
1 1 
aya? <- а? or Ya? <- (7.30) 
А п п 


Consider the sum 


© 
^ 
TMs 
“ч 
Р 
| 
sis 
М4 
N 
il 


ч 
Е 

m 

| 

кә 
з |S 
+ 
зе 
th 
— 


х 
E 


"s 
| 
2 | — 


M 
lbs IM: "ps 
= 
| 
| 
м 
ES 
+ 
I 


which, by assumption (7.30), is less than 0. This is impossible unless a; = 1/n, implying that M is the most 
efficient linear unbiased estimator of и. 


7.5. Show that if 
lim E(©,) = 0 and lim Var(G,) = 0 
п 0 n= 
then the estimator Ө, is consistent. 
Using Chebyshev's inequality (2.97), we can write 


P6, — 8] >) < © —®#1_1 


Р E Е([Ө, m E(@,) + E(®,) т 87) 


1 
22 


E([O, — Е(Ө,)]? + [E(®,) — 01° + 219, — Е(Ө,)ЈГЕ(Ө,) — 01) 


o 


M 


i (Var(®,) + E{[E(@,) — 0]2) + 2E([O, — E(©,)][E(@,) — 81) 


Thus, if 
lim E(O,) = 8 and lim Var(©,) = 0 
then lim P(|©, — 8| 2 ғ) =0 


that is, O, is consistent [see Eq. (7.6)]. 


7.6. Let (X,, ..., Xn) be a random sample of a uniform r.v. X over (0, a), where a is unknown. Show 
that 


A = max(X,, Xz, ..., X.) (7.31) 


is a consistent estimator of the parameter a. 
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If X is uniformly distributed over (0, a), then from Eqs. (2.44), (2.45), and (4.98) of Prob. 4.30, the pdf of 
Z = max(X,, ..., X,) is 

















n-1 

fala) = и =- (5) 0<z<a (7.32) 

Thus E(A) = fro dz = [= йт=———а 
o a" Jo n+l 

and lim E(A) =a 
Next, E(A2) = [Гена dz = 5 pe dz =- _ 5 а 

_ 2 2. па? п?а? _ п i 

Var(A) = EA) - LEY = n-2 (m+ (n42m-ly^ 

and lim Var(A) = 0 


noo 


Thus, by Eqs. (7.7) and (7.8), A is a consistent estimator of parameter a. 


MAXIMUM-LIKELIHOOD ESTIMATION 


7.7. Let (X,, ..., X,) be a random sample of a binomial r.v. X with parameters (m, p), where т is 
assumed to be known and p unknown. Determine the maximum-likelihood estimator of p. 


The likelihood function is given by [Eq. (2.36)] 
m 


Цр) = f(xy Xi р) = (" Эрч - pnm. ("ч - р)" 


= (7) ee. ("рео u руи" E= 
xy X, 


Taking the natural logarithm of the above expression, we get 


In L(p) = Inc «(Xx) in p+ (m — Lx) In(1 — p) 
i=l i=l 
where c= П (") 





jan Mi 
апа a Lp) =* Èx- (m- Xx) 
Setting d[In L(p)]/dp = 0, the maximum-likelihood estimate р, of p is obtained as 
or Put = — Xs (7.33) 


1 g 1 2 
= — =— 7. 
PuL m L == X (7.34) 
7.8.  Let(X,,..., X,) bea random sample of a Poisson r.v. with unknown parameter 4. Determine the 


maximum-likelihood estimator of 4. 
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7.9. 


7.10. 
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The likelihood function is given by [Eq. (2.40)] 
n e ^À5 eg "^-in 
LA) = fes sw A= T= = гү 
Thus, In L(A) = —nd + Indy x, — Inc 
i=1 
where с= [T (х) 
i=1 
d 1 In L(A) = + Y 
an qn L0) = 7n ià* 
Setting d[In L(4)]/dÀ = 0, the maximum-likelihood estimate 2,,; of 2 is obtained as 
4 12 
Ам 7 7 Ух (7.35) 
П i=) 
Hence, the maximum-likelihood estimator of А is given by 
1 2 v 
Ам. = 06, X) 7 УХ X (7.36) 
i=) 
Let (Xi, ..., X,) be a random sample of an exponential r.v. X with unknown parameter À. 
Determine the maximum-likelihood estimator of A. 
The likelihood function is given by [Eq. (2.48)] 
LA) = f(x, x5 A) = [Ае = Ате 070 
ist 
Thus, In ЦА) = nin A — AY; x; 
i=1 
d n л 
а — А)=-— i 
an di In ДА) i PR 
Setting d[In L(4)]/dÀ = 0, the maximum-likelihood estimate Ay, of A is obtained as 
А п 
Амь 7 == (7.37) 
Ух 
i=l 
Hence, the maximum -likelihood estimator of А is given by 
n 1 
Ам = 5(Х\,..., X) — =ў (7.38) 
У Хх, 
= 1 
Let (X,, ..., X,) be a random sample of a normal random г.у. X with unknown mean и and 


unknown variance 02. Determine the maximum-likelihood estimators of и and o?. 


The likelihood function is given by [Eq. (2.52)] 


А 1 1 
Цр, о) = f(x зз Xy My O) = П „Гло j| - 26? (x; -ar 
To 


7 (5) 1 1 п 2 
= rn o^ exp| — 522 Los ~p) 
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Thus, In Др, о) = — 5 (2л) — n In o — —— 2 х; — uy 
In order to find the values of и and с maximizing the above, we compute 


Ü 
ou IP 0 0 = а? zÈ- 


д n 1 А 
a, n Цр, о) = 5+5 Dew 


Equating these equations to zero, we get 


TMs 


(x; — Ды) = 0 
1 


1 п 
; 2.5 — luu =—— 


бы? i=1 Cmi 


and 





Solving for дм and бу, the maximum -likelihood estimates of p and o? are given, respectively, by 


y X (7.39) 
-1 


У (x; — fines)” (7.40) 


i 2 А 

Мы = - УХ, = (7.41) 
12 _ 

Sur => У(Х, - Xy (7.42) 


BAYES' ESTIMATION 
711. Let(X,, ..., X,) be the random sample of a Bernoulli r.v. X with pmf given by (Eq. (2.32)] 
f(xip-p-p! х= 0), 1 (7.43) 


where p 0 <р < 1, is unknown. Assume that р is a uniform г.у. over (0, 1). Find the Bayes’ 
estimator of p. 


The prior pdf of p is the uniform pdf; that is, 


/ (р) = 1 0О<р<1 
The posterior pdf of p is given by 
f (X15 > Xn P) 
[Xp s, X) = —M————— 
/Ф!хь» tm ES 


Then, by Eq. (7.12), 
fos s xs Р) = Гь «s х, PSP) 
= рїї — р)" 217 = рч рус" 
where m = Y7., x; and by Eq. (7.13), 


1 
ese o [ fen pdp = [ir = pr" à 
D 


Now, from calculus, for integers m and k, we have 


m!k! 


1 
m _ k = m 
[^ (1 — py dp (m  k 4 1)! (7.44) 
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Thus, by Eq. (7.14), the posterior pdf of p is 


Је ea Xps р) (л + 1)! p”(1 = ру" 
Гріх o x) = хох) m!(n — т)! 





and by Eqs. (7.15) and (7.44), 
1 
Elp] х}, teed X,) = | pf(plx,, эз х„) dp 
0 


+1! of! 
= (п ) | p"! _ ру" ар 
0 


т! {п — т)! 
0 (n+l)! (m 0) и — т)! 
~ тіп — m)! (n + 2)! 


] n 
amtt_ (Sx+1) 


n-r2 п+2 








Hence, by Eq. (7.16), the Bayes’ estimator of p is 


„= BX cs X) es (XXe 1) (745) 
i=] 


7.12. Let (X,, ..., X,) be a random sample of an exponential r.v. X with unknown parameter 4. 
Assume that / is itself to be an exponential г.у. with parameter х. Find the Bayes’ estimator of 4. 


The assumed pnor pdf of A is [Eq. (2.48)] 


ae ^^ оа А>0 
А) = ' 
f(a) f otherwise 
Now Гохо х, 1А = [Ае = Are 2E mo ane m 


i=l 


where m = У". , x,. Then, by Eqs. (7.12) and (7.13), 


Јо, x [ Го «V ТАЛА) dà 
o 


= | Ae "ge^ dd 
0 


e ! 
= Ane tma dÀ = n __ 
а f e X (a + my*! 
By Eq. (7.14), the posterior pdf of А is given by 
u ADA att jag—(atm)a 
Лх) = f(x xf). (a+ my" есет" 
/(ху,..., Xp) n! 


Thus, by Eq. (7.15), the Bayes’ estimate of A is 
Ay = Ex, x) | AF | xy, sso x) dÀ 
о 


(а + т)". 


л 
; | Диет tat ma di 
п! o 


(+ т)" (n+ n 
n! (a + my *? 





+1 +1 
= QT (7.46) 
а+т 
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and the Bayes’ estimator of å is 
Apo — tt -= L 
a+n 
а+ Y X, 


7.13. Let (X,,.. 


of u. 
The assumed prior pdf of и is 
] 6212 





Then by Eq. (7.12), 














fixi хь, А) = (х, х, д) / (д) 
1 - (x, — uy. 1 -p2 
С Quy e| -T 2 le И 
п x4 
"m | 
EE erp гыз 


= (2л) +132 





n xj? ] n 2 
ew - L z$) e z +1) (£5) Е (n+ 10) (x 
2 


і= 1 2 
= (2л * 02 


Then, by Eq. (7.14), the posterior pdf of и is given by 
f Xp Xas и) __ 


дих) 
[ бху, s х, H) du 





=Ce — — 
| 2 as im) 
where С = C(x,, ..., x,) is independent of u. However, Eq. (7.48) is just the pdf of a normal r.v. with mean 
1 n 
n+ | (25) 


1 


and variance 
nal 





Ух 





Hence, the conditional distribution of и given x,, 
( 1 ) 


and variance 








Thus, the Bayes’ estimate of и is given by 
1 
Ух 





чс 
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(7.47) 


., X,) be a random sample of a normal r.v. X with unknown mean и and variance 1. 
Assume that џ is itself to be a normal r.v. with mean 0 and variance 1. Find the Bayes’ estimator 


., X, is the normal distribution with mean 


(7.49) 
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7.14, 


7.15. 
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and the Bayes’ estimator of џ is 
Mp=—~ Y X;2—— X (7.50) 
P nl; i nl f 
Let (X,, ..., X,) be a random sample of a r.v. X with pdf f(x; 6), where @ is an unknown 


parameter. The statistics L and U determine a 100(1 — «) percent confidence interval (L, U) for 
the parameter 6 if 


P(L«0«U)21—a O<a<1 (7.51) 


and 1 — « is called the confidence coefficient. Find L and U if X is a normal r.v. with known 
variance с? and mean и is an unknown parameter. 


If X = №; о?), then 





is a standard normal r.v., and hence for a given x we can find a number z,,, from Table A (Appendix A) 
such that 


X-u 
P| —z,« ——«zy;|]-1—&a (7.52) 


cj fn 


For example, if 1 — х = 0.95, then z,; = 29.925 = 1.96, and if 1 — а = 0.9, then z,,. = 20.05 = 1.645. Now, 
recalling that с > 0, we have the following equivalent inequality relationships; 


ATH 
ajn 
—2,2(9/4/т) < X — и < zalan) 
-X- zo (0/ n) «-u«-X- Zulo n) 
and Х + zalol n > и> X — zoo] |n) 


Thus, we have 


—Zy2 € < Zu2 


Р(Х — 2,(9/4/т) < и < X + zo] |n] = 1—« (7.53) 
and so 


L-X-zp(c//n and U=X + zloj n) (7.54) 


Consider a normal r.v. with variance 1.66 and unknown mean џ. Find the 95 percent confidence 
interval for the mean based on a random sample of size 10. 


As shown in Prob. 7.14, for 1 ~ a = 0.95, we have 2, = 29,95, = 1.96 and 
z So] n) = 1.96(./1.66/./10) = 0.8 
Thus, by Eq. (7.54), the 95 percent confidence interval for p is 
(X — 0.8, X + 0.8) 


MEAN SQUARE ESTIMATION 


7.16. 


Find the m.s. estimate of a r.v. Y by a constant c. 


By Eq. (7.17), the m.s. error is 


e = ERY - cy] = | (у = Ffy) dy (7.55) 


-D 


CHAP. 7] ESTIMATION THEORY 259 


Clearly the m.s. error e depends on c, and it is minimum if 


de 2 |" dy =0 
= D ФЛ dy = 


ог с [ Ду) ду = с -| yf) dy 


Thus, we conclude that the m.s. estimate c of Y is given by 


ў=с= | yf(y) dy = E(Y) (7.56) 
7.17. Find the m.s. estimator of a r.v. Y by a function g(X) of the r.v. X. 
By Eq. (7.17), the m.s. error is 


e = E([Y – 9001] = [ [Г [у — $90 f Gs, y) dx dy 
Since f (x, y) = f(y| x) f(x), we can write 


e= F sof [Г Ly — 291 f 015) 2 dx (7.57) 


Since the integrands above are positive, the m.s. error e is minimum if the inner integrand, 
[у = #(х)} f |х) dy (7.58) 


is minimum for every x. Comparing Eq. (7.58) with Eq. (7.55) (Prob. 7.16), we see that they are the same 
form if c is changed to g(x) and f (y) is changed to f(y| x). Thus, by the result of Prob. 7.16 [Eq. (7.56)], we 
conclude that the m.s. estimate of Y is given by 


у = | уло) dy = Е(Ү |х) (7.59) 


— o 
Hence, the m.s. estimator of Y is 


f = g(X) = E(Y | X) (7.60) 


7.48. Find the m.s. error if g(x) = E(Y |x) is the m.s. estimate of Y. 


As we see from Eq. (3.58), the conditional mean E(Y | x) of Y, given that X = x, is a function of x, and 
by Eq. (4.39), 


E[E(Y|X)] = E(Y) (7.61) 


Similarly, the conditional mean E[g(X, Y)| x] of g(X, Y), given that X = x, is a function of x. It defines, 
therefore, the function E[g(X, Y)| X] of the r.v. X. Then 


E(E[g(X, Ү)|Х]} = | À | | * gis УЛУ!) ay re dx 


— о - 


7 | | f gix, Fy LO S(x) dx dy 


= f f g(x, y) f(x, y) dx dy = E[g(X, Ү)] (7.62) 


Note that Eq. (7.62) is the generalization of Eq. (7.61). Next, we note that 
E[g.(X)g( YM x] = Elgi)gAY) x] = g (VELAY) |x] (7.63) 
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Then by Eqs. (7.62) and (7.63), we have 
Е[9,(Х)9:(У) = E{E(g(X)g2(¥)| X]} = Ela (XEY) X ]] (7.64) 
Now, setting g,(X) = g(X) and g,(Y) = Y in Eq. (7.64), and using Eq. (7.18), we obtain 
E[g(X)Y) = Elg(X)E(Y | X)] = Eg? (X)) 
Thus, the m.s. error is given by 


e = E((Y — 9(Х)]?) = E(Y?) - 2E[g(X) Y] + ELg? 0] 
= E(Y?) - E[g'(X)) (7.65) 


7.19. Let Y = X? and X be a uniform r.v. over (— 1, 1). Find the m.s. estimator of Y in terms of X and 
its m.s. error. 


By Eq. (7.18), the m.s. estimate of Y is given by 
gx) = E(Y |x) = EX?|X = x) = х? 
Hence, the m.s. estimator of Y is 
ӯ = Х? (7.66) 


The mss. error is 





е = E(LY — ф(Х)]?} = E{LX? — X?) =0 (7.67) 


LINEAR MEAN SQUARE ESTIMATION 
7.20. Derive the orthogonality principle (7.21) and Eq. (7.22). 
By Eq. (7.20), the m.s. error is 
ela, b) = E((Y — (aX + br 


Clearly, the m.s. error e is a function of a and b, and it is minimum if де/да = 0 and de/db = 0. Now 


z. Е(2[Ү — (aX + b)( - X)) = —2E{[Y – (aX + b))X} 
6 = E{2[Y — (aX b) - 0) = -2E{LY — (aX + Ь)]) 
Setting де/да = 0 and де/дЬ = 0, we obtain 
E([Y — (aX + b]X) =0 (7.68) 
Е[Ү — (aX + 8] 20 (7.69) 


Note that Eq. (7.68) is the orthogonality principle (7.21). 
Rearranging Eqs. (7.68) and (7.69), we get 


E(X?)a + Е(Х\Ь = E(X Y) 
E(X)a + b = E(Y) 
Solving for a and b, we obtain Eq. (7.22); that is, 
_ EXY)- EX)EY) _ oxy _ oy 





TEED оу! ^" 
b = E(Y) — aE(X) = py — apy 
where we have used Eqs. (2.31), (3.51), and (3.53). 


7.21. Show that m.s. error defined by Eq. (7.20) is minimum when Eqs. (7.68) and (7.69) are satisfied. 
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7.22. 


7.23. 


7.24. 


Assume that Y = cX + d, where c and d are arbitrary constants. Then 
e(c, d) = E([Y — (cX + dy) = E([Y — (aX + b) + (a — X + (b — d?) 
= E([Y — (aX + 5))?) + Еа — OX + (b — 4)]2) 
+ Xa — c)E([Y — (aX + Б)]Х)} + Ab — DE{LY — (aX + b)]) 
= e(a, b) + E([(a — сх + (b — d)]?} 
+ Xa — c)E{LY — (aX + b)]X} + 20b — d)E([Y — (aX + 5)]) 


The last two terms on the right-hand side are zero when Eqs. (7.68) and (7.69) are satisfied, and the second 
term on the right-hand side is positive if a # c and b = d. Thus, e(c, d) > e(a, Б) for any c and d. Hence, 
e(a, b) is minimum. 


Derive Eq. (7.23). 
By Eqs. (7.68) and (7.69), we have 
R(EY — (aX + Бах} =0 = E([Y — (aX + bjb} 


Then e, = e(a, b) = E([Y — (aX + b)]?} = E([Y — (aX + LY — (aX + b)]) 
= E([Y — (aX + b)]Y} = E(Y?) — aE(XY) — bE(Y) 


Using Eqs. (2.31), (3.51), and (3.53), and substituting the values of a and b [Eq. (7.22)] in the above expres- 
sion, the minimum m.s. error is 


Em = Oy? + Hy? — a8 yy + Hy Hy) — (Hy — арх)иу 


Oxy" Oxy? 
— p2 =m 2 XY _,2 ХҮ — 2 2 
= оү —d6y,— бү ———у =аү|1———у—у}= оү (1— ру) 
Ox Cy 








which is Eq. (7.23). 


Let Y = X?, and let X be a uniform r.v. over (—1, 1) (see Prob. 7.19). Find the linear m.s. 
estimator of Y in terms of X and its m.s. error. 


The linear m.s. estimator of Y in terms of X is 


¥=aX+b 
where a and Б are given by [Eq. (7.22)] 
a= PE b = py — apy 
Now, by Eqs. (2.46) and (2.44), 
uy = E(X) = 0 


E(X Y) = EXX?) = E(X)) = ; [= dx = 0 
Ву Еа. (3.51), 
Oxy = Cov(XY) = E(XY) — E(X)E(Y) = 0 
Thus, a = 0 and b = E(Y), and the linear m.s. estimator of Y is 
Y-b-E(Y) (7.70) 
and the m.s. error is 


e = Ei[Y — EY] = сү? (7.71) 


Find the minimum m.s. error estimator of Y in terms of X when X and Y are jointly normal 
r.v.'s. 
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7.25. 


7.26. 


7.27. 


7.28. 


7.29. 
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By Eq. (7./8), the minimum m.s. error estimator of Y in terms of X is 
f= E(Y|X) 
Now, when X and Y are jointly normal, by Eq. (3.108) (Prob. 3.51), we have 
c c 
E(Y|x) = pzy — X + hy — рху — x 
9x ox 


Hence, the minimum m.s. error estimator of Y is 


^ с с 
Ӯ = E(Y|X) = руу ~ X + uy — рху — Hx (7.72) 
ox Ox 


Comparing Eq. (7.72) with Eqs. (7.19) and (7.22), we see that for jointly normal r.v.'s the linear m.s. estima- 
tor is the minimum m.s. error estimator. 


Supplementary Problems 


Let (X,, ..., X,) be a random sample of X having unknown mean и and variance o?. Show that the 
estimator of ø? defined by 





S? = Yx,- Ху 


п 11 


where X is the sample mean, is an unbiased estimator of c?. Note that S,? is often called the sample 
variance. 


Hint: Show that $,? = — 





i S?, and use Eq. (7.29). 
n— 


Let (X,, ..., X,) be a random sample of X having known mean и and unknown variance о?. Show that the 
estimator of o? defined by 


1 n 
So? =~ У(Х, — Ш? 
nici 
is an unbiased estimator of о?. 
Hint: Proceed as in Prob. 7.2. 
Let (X,,..., X,) be a random sample of a binomial r.v. X with parameter (m, p), where p is unknown. Show 


that the maximum-likelihood estimator of p given by Eq. (7.34) is unbiased. 
Hint: Use Eq. (2.38). 


Let (X ,, ..., X,) be a random sample of a Bernoulli r.v. X with pmf f(x; p) = p*(1 — p)! *, x = 0, 1, where 
p,0 <p < 1, is unknown. Find the maximum-likelihood estimator of p. 


12 = 
Ans. Рм =- УХ, = Х 
nic, 
The values of a random sample, 2.9, 0.5, 1.7, 4.3, and 3.2, are obtained from a r.v. X that is uniformly 
distributed over the unknown interval (a, b). Find the maximum-likelihood estimates of a and b. 


Ans. йыр = min x; = 0.5, bur = max x, = 4.3 
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7.30. 


7.31. 


7.32. 


7.33. 


7.34. 


7.35. 


In analyzing the flow of traffic through a drive-in bank, the times (in minutes) between arrivals of 10 
customers are recorded as 3.2, 2.1, 5.3, 4.2, 1.2, 2.8, 6.4, 1.5, 1.9, and 3.0. Assuming that the interarrival time 
is an exponential r.v. with parameter A, find the maximum likelihood estimate of 4. 


І 
А = — 
Ans. Ам. = 376 


Let (X,, ..., X,) be a random sample of a normal r.v. X with known mean и and unknown variance с?. 
Find the maximum likelihood estimator of с?2. 


М х 


1 
Ans Syr = = (X; — д)? 
i=l 


Let (Xi, ..., X,) be the random sample of a normal r.v. X with mean и and variance o?, where џи is 
unknown. Assume that u is itself to be a normal r.v. with mean и, and variance с; 2. Find the Bayes’ 
estimate of д. 


А Hy nX 1 п . 1 
Ans. jig = 2291 52+ 9 х= 


Let (X,,..., X,) be the random sample of a normal г.у. X with variance 100 and unknown д. What sample 
size n is required such that the width of 95 percent confidence interval is 5? 


Ma 
Kai 


Ans. n= 62 


Find a constant a such that if Y is estimated by aX, the m.s. error is minimum, and also find the minimum 
т.5. error €p- 


Ans. а = E(XY)/E(X?) e, = E(Y?) — LEXY) PLEX)? 


Derive Eqs. (7.25) and (7.26). 
Hint: Proceed as in Prob. 7.20. 


Chapter 8 


Decision Theory 


8.1 INTRODUCTION 


There are many situations in which we have to make decisions based on observations or data 
that are random variables. The theory behind the solutions for these situations is known as decision 
theory or hypothesis testing. In communication or radar technology, decision theory or hypothesis 
testing is known as (signal) detection theory. In this chapter we present a brief review of the binary 
decision theory and various decision tests. 


8.2 HYPOTHESIS TESTING 
A. Definitions: 


A statistical hypothesis is an assumption about the probability law of r.v.’s. Suppose we observe a 
random sample (X,, ..., X,) ofa r.v. X whose pdf f(x; 0) = f(x,, ..., x,; 0) depends on a parameter б. 
We wish to test the assumption 0 = 6, against the assumption 6 = 0,. The assumption 0 = 0, is 
denoted by Н, and is called the null hypothesis. The assumption 0 = 0, is denoted by Н, and is called 
the alternative hypothesis. 


Hy: 0-60, (Null hypothesis) 
H,: 0-0, (Alternative hypothesis) 


A hypothesis is called simple if all parameters are specified exactly. Otherwise it is called compos- 
ite. Thus, suppose Ho: 0 = 0, and H,:0 # 0o; then Ho is simple and Н, is composite. 


B. Hypothesis Testing and Types of Errors: 


Hypothesis testing is a decision process establishing the validity of a hypothesis. We can think of 
the decision process as dividing the observation space R" (Euclidean n-space) into two regions Ry and 
R,. Let x = (x,, ..., x,) be the observed vector. Then if x є Ro, we will decide on Hy; if x e R,, we 
decide on H,. The region Rọ is known as the acceptance region and the region R, as the rejection (or 
critical) region (since the null hypothesis is rejected). Thus, with the observation vector (or data), one 
of the following four actions can happen: 


H true; accept Ho 
Ho true; reject Ho (or accept Н,) 
H, true; accept H, 


Fen PS 


H, true; reject H, (or accept Ho) 


The first and third actions correspond to correct decisions, and the second and fourth actions corre- 
spond to errors. The errors are classified as 


І. Typelerror: Reject Н, (or accept H,) when Hj is true. 
2. Туре П еггог: Reject H, (or accept Ho) when H, is true. 


Let P, and Py denote, respectively, the probabilities of Type I and Type II errors: 
P, = P(D,| Ho) = P(x € R,; Ho) (8.1) 
Py = P(Do| Hj) = P(x e Ro; Hi) (8.2) 
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where D, (i = 0, 1) denotes the event that the decision is made to accept H;. P, is often denoted by a 
and is known as the level of significance, and Py is denoted by В and (1 — f) is known as the power of 
the test. Note that since а and B represent probabilities of events from the same decision problem, 
they are not independent of each other or of the sample size n. It would be desirable to have a 
decision process such that both « and f will be small. However, in general, a decrease in one type of 
error leads to an increase in the other type for a fixed sample size (Prob. 8.4). The only way to 
simultaneously reduce both type of errors is to increase the sample size (Prob. 8.5). One might also 
attach some relative importance (or cost) to the four possible courses of action and minimize the total 
cost of the decision (see Sec. 8.3D). 
The probabilities of correct decisions (actions 1 and 3) may be expressed as 


P(D,| Ho) = P(x € Ro; Ho) (8.3) 
P(D,|H,) = P(x e R,; Hy) (8.4) 
In radar signal detection, the two hypotheses are 
H,: No target exists 
H,: Target is present 


In this case, the probability of a Type 1 error P, = P(D,| Не) is often referred to as the false-alarm 
probability (denoted by Р,), the probability of a Type H error Py = Р(р,|Н,) as the miss probability 
(denoted by Pu), and P(D,|H,) as the detection probability (denoted by Pp). The cost of failing to 
detect a target cannot be easily determined. In general we set a value of P, which is acceptable and 
seek a decision test that constrains Р to this value while maximizing Pp (or equivalently minimizing 
Py). This test is known as the Neyman-Pearson test (see Sec. 8.3C). 


83 DECISION TESTS 
A. Maximum-Likelihood Test: 


Let x be the observation vector and P(x | Hj), i 2 0.1, denote the probability of observing x given 
that Н, was true. In the maximum-likelihood test, the decision regions Ry and К, are selected as 


Ro = (x: P(x| Ho) > Р(хіН,)} 


8.5 
К, = (x: P(x| Ho) < P(x| Hi)) (8.5) 
Thus, the maximum-likelihood test can be expressed as 
H if P(x| Ho) > Р(х\Н)) 
dx)-4.^ . 8.6 
(9 fa if P(x | Ho) < P(x| Hy) 05 
The above decision test can be rewritten as 
P(x| Hy) % 
8.7) 
P(x| Ho) m 
If we define the likelihood ratio A(x) as 
Р(х|Н,) 
A(x) = = (8.8) 
09 = px Ho) 
then the maximum-likelihood test (8.7) can be expressed as 
Hi 
A(x) 2 1 (8.9) 


Ho 
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which is called the likelihood ratio test, and 1 is called the threshold value of the test. 
Note that the likelihood ratio A(x) is also often expressed as 


ЈН) 


А = кн.) 


(8.10) 


В. MAP Test: 


Let P(H;|x), i= 0, 1, denote the probability that H; was true given a particular value of x. The 
conditional probability Р(Н,| х) is called a posteriori (or posterior) probability, that is, a probability 
that is computed after an observation has been made. The probability P(H;), i = 0, 1, is called a priori 
(or prior) probability. In the maximum a posteriori (MAP) test, the decision regions Rọ and R, are 
selected as 


Ro = {x: Р(Нь|х) > P(A, 


x)} 





8.11 
R, = (x: P(Ho|x) < P(H,|x)} 
Thus, the MAP test is given by 
H if Р(Н,|х) > P(H,|x) 
d(x) = 4 ° 12 
(9) P if P(Ho|x) < РІН, |x) (8.12) 
which can be rewritten as 
РОН, |х) 8 
1 8.13 
P(Ho|x) s; (813) 


Using Bayes’ rule [Eq. (1.42)], Eq. (8.13) reduces to 


Ay 
PI HPH) > у (8.14) 
Р(х | Ho)P(Ho) н, 
Using the likelihood ratio A(x) defined in Eq. (8.8), the MAP test can be expressed in the following 
likelihood ratio test as 





н P(H, 
z = 
A(x) ёт рн.) (8.15) 


where у = P(H,)/P(H,) is the threshold value for the MAP test. Note that when Р(Н,) = P(H,), the 
maximum-likelihood test is also the MAP test. 


C. Neyman-Pearson Test: 


As we mentioned before, it is not possible to simultaneously minimize both «(= Pj) and 8(= Py). 
The Neyman-Pearson test provides a workable solution to this problem in that the test minimizes fl 
for a given level of a. Hence, the Neyman-Pearson test is the test which maximizes the power of the 
test 1 — В for a given level of significance a. In the Neyman-Pearson test, the critical (or rejection) 
region R, is selected such that 1 — $ = 1 — P(D)|H,) = P(D,|H,) is maximum subject to the con- 
straint х = P(D,| Hj) = a. This is a classical problem in optimization: maximizing a function subject 
to a constraint, which can be solved by the use of Lagrange multiplier method. We thus construct the 
objective function 


J = (1 — f) — Àla — ao) (8.16) 


where A > 0 is a Lagrange multiplier. Then the critical region R, is chosen to maximize J. It can be 
shown that the Neyman-Pearson test can be expressed in terms of the likelihood ratio test as 
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(Prob. 8.8) 
Hi 


A(x) 21 = 4 (8.17) 


Но 


where the threshold value y of the test is equal to the Lagrange multiplier 4, which is chosen to satisfy 
the contraint х = a. 


D. Bayes’ Test: 


Let C;, be the cost associated with (D;, Hj), which denotes the event that we accept Н, when Н, is 
true, Then the average cost, which is known as the Bayes’ risk, can be written as 


С = Coo P(Do, Ho) + Cio P(Dy, Ho) + Cor P(Do, H4) + C, P(D,, Hy) (8.18) 
where P(D;, Н) denotes the probability that we accept H; when Н, is true. By Bayes’ rule (1.42), we 
have 

С = Coo P(Do | Ho)P(Ho) + Cio P(D,| Ho) P(Ho) + Co, P(Do| H1)P(H1) + С,,Р(О,|Н,)Р(Н,) (8.19) 
In general, we assume that 
Cio > Coo and Cor > Ci (8.20) 


since it is reasonable to assume that the cost of making an incorrect decision is higher than the cost of 

making a correct decision. The test that minimizes the average cost C is called the Bayes’ test, and it 

can be expressed in terms of the likelihood ratio test as (Prob. 8.10) 
т (Cio — Coo)P(Ho) 


MO) ёл = (ei СРН.) 


Note that when Cio — Соо = Co, — C11, the Bayes’ test (8.21) and the MAP test (8.15) are identical. 


(8.21) 


E. Minimum Probability of Error Test: 
If we set Coo = C,, =O and Co, = Cio = 1 in Eq. (8.18), we have 
С = P(D,, Ho) + P(Do, Hi) = P, (8.22) 


which is just the probability of making an incorrect decision. Thus, in this case, the Bayes’ test yields 
the minimum probability of error, and Eq. (8.21) becomes 





(8.23) 


We see that the minimum probability of error test is the same as the MAP test. 


F. Minimax Test: 


We have seen that the Bayes’ test requires the a priori probabilities Р(Н,) and P(H,). Frequently, 
these probabilities are not known. In such a case, the Bayes’ test cannot be applied, and the following 
minimax (min-max) test may be used. In the minimax test, we use the Bayes’ test which corresponds 
to the least favorable P(H,) (Prob. 8.12). In the minimax test, the critical region Rf is defined by 

max C[P(H;), RT] = min max С[Р(Н,), Ri] < max C[P(H,), R,] (8.24) 


P(Ho) Еј Р(Но) P(Ho) 
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for all К, # Rf. In other words, Rf is the critical region which yields the minimum Bayes’ risk for the 
least favorable P(H,). Assuming that the minimization and maximization operations are interchange- 
able, then we have 


min max C[P(H,), Кү] = max min C[P(H,), Ry] (8.25) 


Ry P(Ho) P(Ho) Ri 


The minimization of C[P(H,), R,] with respect to R, is simply the Bayes’ test, so that 
min C[P(Hj), Ri] = С*[Р(Н,)] (8.26) 
Ri 


where C*[P(H,)] is the minimum Bayes’ risk associated with the a priori probability P(Ho). Thus, Eq. 
(8.25) states that we may find the minimax test by finding the Bayes’ test for the least favorable P(H,), 
that is, the Р(Н,) which maximizes C[P(H,)]. 


Solved Problems 


HYPOTHESIS TESTING 


8.1. Suppose a manufacturer of memory chips observes that the probability of chip failure is p = 0.05. 
A new procedure is introduced to improve the design of chips. To test this new procedure, 200 
chips could be produced using this new procedure and tested. Let r.v. X denote the number of 
these 200 chips that fail. We set the test rule that we would accept the new procedure if X « 5. 
Let 

Ho: p=0.05 (No change hypothesis) 
Ну: p« 0.05 (Improvement hypothesis) 


Find the probability of a Type I error. 


If we assume that these tests using the new procedure are independent and have the same probability 
of failure on each test, then X is a binomial r.v. with parameters (n, p) = (200, p). We make a Type I error if 
X < 5 when in fact p = 0.05. Thus, using Eq. (2.37), we have 


P, = P(D,| Ho) = P(X < 5; р = 005) 
5 [200 
= У ( 054095700" 
кто \ К 
Since n is rather large and p is small, these binomial probabilities can be approximated by Poisson prob- 
abilities with 4 = np = 200(0.05) = 10 (see Prob. 2.40). Thus, using Eq. (2.100), we obtain 
5 10* 
Pix уе !? — = 0.067 
k=0 k! 


Note that Ho is a simple hypothesis but Н, is a composite hypothesis. 


82. Consider again the memory chip manufacturing problem of Prob. 8.1. Now let 


Ho: p=0.05 (No change hypothesis) 
H,: p=0.02 (Improvement hypothesis) 


Again our rule is, we would reject the new procedure if X > 5. Find the probability of a Type II 
error. 


CHAP. 8] 


8.3. 


DECISION THEORY 269 


Now both hypotheses are simple. We make a Type II error if X > 5 when in fact р = 0.02. Hence, by 


Eq. (2.37), 


Ру = P(Dy| H,) = P(X > 5; p = 0.02) 


= X Ie ооз (0.98200 -* 


Again using the Poisson approximation with 4 = np = 200(0.02) = 4, we obtain 


k 


54 
Pil- Ye *— = 0.215 
k=0 k! 


Let (X,,..., X,) be a random sample of a normal r.v. X with mean и and variance 100. Let 


Hy: p= 50 
Ay: p= m (>50) 


and sample size n = 25. As a decision procedure, we use the rule to reject Ho if X > 52, where x is 
the value of the sample mean X defined by Eq. (7.27). 


(a) 
(b) 
(c) 


(a) 


Find the probability of rejecting Hy: и = 50 as a function of u (> 50). 
Find the probability of a Type I error a. 
Find the probability of a Type П error f (1) when и, = 53 and (ii) when и, = 55. 
Since the test calls for the rejection of Hy: и = 50 when x > 52, the probability of rejecting Hy is given 
by 

glu) = P(X > 52; и) (8.27) 
Now, by Eqs. (4.112) and (7.27), we have 


Var(X) = oy? = 


Thus, X is N(u; 4), and using Eq. (2.55), we obtain 


X- 2— 52 — 
a = P( ESSERI =) и> 50 (8.28) 











The function g(u) is known as the power function of the test, and the value of g(x) at и = i, (и), is 
called the power at py. 

Note that the power at и = 50, g(50), is the probability of rejecting Hg: и = 50 when Ho is true—that 
is, a Type I error. Thus, using Table A (Appendix A), we obtain 


52 — 50 
2 





z= P, = g(50) = 1 — of ) = i-em = 01387 


Note that the power at и = jy, g(u,), is the probability of rejecting Ho: и = 50 when и = py. Thus, 
1 — g(u,) is the probability of accepting H when и = p,— that is, the probability of a Type II error f. 


(i) Setting и = и, = 53 in Eq. (8.28) and using Table A (Appendix A), we obtain 





B= Py = 1 — 9(53) = o( j :J = Ф(—}) = 1 — 4(1) = 0.3085 


(ii) Similarly, for p = и, = 55 we obtain 


B= Py =1—9(55) = (2 j =) = Ф(— $) = 1 — 4(3) = 0.0668 





Notice that clearly, the probability of a Type II error depends on the value of 4. 
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8.4. Consider the binary decision problem of Prob. 8.3. We modify the decision rule such that we 
reject Ho if x > c. 


(a) Find the value of c such that the probability of a Type I error « = 0.05. 
(b) Find the probability of a Type II error f when и, = 55 with the modified decision rule. 


(a) Using the result of part (b) in Prob. 8.3, c is selected such that [see Eq. (8.27)] 
a = 0(50) = P(X > с; и = 50 = 0.05 
However, when и = 50, X = N(50; 4), and [see Eq. (8.28)] 


X —50 c—50 — 50 
«so = P( ; >> i $)-1- (25 ) = oos 


From Table A (Appendix A), we have 41.645) = 0.95. Thus 











с — 50 


z = 1.645 and с = 50 + 2(1.645) = 53.29 





(b) The power function g(u) with the modified decision rule is 
_ X- 53.29 — 53.29 — 
g(u) = P(X > 53.29; и) = (E 2 M «) =1- (22-4) 


Setting и = p, = 55 and using Table A (Appendix А), we obtain 


B = Pa = 1 — 9(55) = (22 =>) = q( — 04855) 


= 1 — (0.855) = 0.1963 


Comparing with the results of Prob. 8.3, we notice that with the change of the decision rule, œ is 
reduced from 0.1587 to 0.05, but $ is increased from 0.0668 to 0.1963. 


8.5. Redo Prob. 84 for the case where the sample size n = 100. 


(a) With n = 100, we have 


Уақ) = 0x = – о? = 


1 
п 
As in part (a) of Prob. 8.4, с is selected so that 

« = 9(50) = P(X > c; ин = 50) = 0.05 
Since X = N(50; 1), we have 


X — 50 — 50 
aso) = 0 - >f = [= 50) = 1 ~ 67 50) = 005 








Thus € — 50 = 1.645 and с = 51.645 
(b) The power function is 


glu) = Р(Х > 51.645; и) 
Eri. meni 





; n) = 1 — 0(51.645 — y) 


Setting и = д, = 55 and using Table A (Appendix A), we obtain 
B = Py = 1 — 9(55) = Ф(51.645 — 55) = d( — 3.355) = 0.0004 
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Notice that with sample size n = 100, both a and f have decreased from their respective original values 
of 0.1587 and 0.0668 when n = 25. 


DECISION TESTS 


8.6. In a simple binary communication system, during every T seconds, one of two possible signals 
So(t) and s,(t) is transmitted. Our two hypotheses are 


Hg:  So(t) was transmitted. 
H,: s,(t) was transmitted. 
We assume that 
500) 2-0 and s\(th=1 O<t<T 


The communication channel adds noise n(t), which is a zero-mean normal random process with 
variance 1. Let x(t) represent the received signal: 


x(t) = s(t) + n(t) i=0,1 


We observe the received signal x(t) at some instant during each signaling interval. Suppose that 
we received an observation x = 0.6. 


(a) Using the maximum likelihood test, determine which signal is transmitted. 
(b) Find P, and Py. 
(а) The received signal under each hypothesis can be written as 
Hy: x=n 
H: х=1+п 
Then the pdf of x under each hypothesis is given by 
1 


2л 
1 


л 


—х%{2 





f(x] Ho) = е 


eg &- D72 





Д(х|Н,) = 


The likelihood ratio is then given by 


A(x) = f(x} B) = ptt 1/2) 
f(x| Ho) 
By Eq. (8.9), the maximum likelihood test is 
Hi 
eT 1/2) 2 1 
Ho 


Taking the natural logarithm of the above expression, we get 


Hi Hi 
x-$20 or x24 
Ho Ho 


Since х = 0.6 > 4, we determine that signal s,(t) was transmitted. 


(b) The decision regions are given by 


Ro = {x: x < 4} =(— ©, 3) R, = (x: x > 4} = (5, оо) 
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Then by Eqs. (8.1) and (8.2) and using Table A (Appendix A), we obtain 


] a. 
P, = P(D,IHg) = | f(x| Hg) dx = | e Y? dx = 1 — Ф(1) = 0.3085 
| Ry V 2n Ју 


12 
e 07 dx 


us 








Py = PISO = | f(x| H4) dx = 
Ro 


л 


1 


я 





-71/2 
| 72 dy = &( — $) = 0.3085 


8.7. In the binary communication system of Prob. 8.6, suppose that Р(Н,) = 2 апа P(H,) = 1. 


(a) Using the MAP test, determine which signal is transmitted when x = 0.6. 
(b Find P; and Р. 
(а) Using the result of Prob. 8.6 and Eq. (8.15), the MAP test is given by 

н P(A) 


Taking the natural logarithm of the above expression, we gel 
Hi Hi 
х= 21р 2 or х2 5 +102 = 1.193 
Ho Ho 


Since x = 0.6 < 1.193, we determine that signal so(t) was transmitted. 
(b) The decision regions are given by 
Ro = (x: x < 1.193} = (— оо, 1.193) 
R, = (x: x > 1,193} = (1.193, со) 
Thus, by Eqs. (8./) and (8.2) and using Table A (Appendix A), we obtain 


І 
/2л 


N 





рерин | f(x| Hg) dx = | e 72 dy = | — (1.193) = 0.1164 
Ry 1.193 


1 1.193 
Py = P(Do|A =f f(x| H x=] e G7 I dx 
и olti ko 1 Ja „ 


| 
Jin 


0.193 
| e^??? dy = Ф(0.193) = 0.5765 


8.88. Derive the Neyman-Pearson test, Eq. (8.17). 


From Eq. (8.16), the objective function is 
J = (1 B) — Ma — x9) = P(D, | H,) — ALP(D, | Ho) – ж] (8.29) 


where 4 is an undetermined Lagrange multiplier which is chosen to satisfy the constraint « =a). Now, we 
wish to choose the critical region R, to maximize J. Using Eqs. (8./) and (8.2), we have 


J -| sanoal | fl} He) dx — a 
Ry Ry 


= | [A(x] H4) — Af (x | Ho)] dx + ato (8.30) 
Ri 
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To maximize J by selecting the critical region R,, we select x e R, such that the integrand in Eq. (8.30) is 
positive. Thus R, is given by 


R, = {x: [f(x | H) - Af(x| Hg)] > 0j 
and the Neyman-Pearson test is given by 


1н) АЙ 


А = 
0 = кн 2 


and A is determined such that the constraint 
«=P= рона | f(x | Ho) dx = ж 
Ry 


is satisfied. 


8.9. Consider the binary communication system of Prob. 8.6 and suppose that we require that « = 


Р, = 0.25. 
(a) Using the Neyman-Pearson test, determine which signal is transmitted when x = 0.6. 
(b) Find Py. 


(a) Using the result of Prob. 8.6 and Eq. (8.17), the Neyman-Pearson test is given by 
Hi 
ecc 12) 2 À 
Ho 


Taking the natural logarithm of the above expression, we get 


Hy Hy 
x-$2 Ind or x2$4Ind 
Ho Ho 


The critical region R, is thus 
Ку ={x:x>4+4In А} 
Now ме must determine A such that a = P, = P(D,| Ho) = 025. By Eq. (8.1), we have 
| * 1 
P, = P(D,| Ho) -f f(x| Ho) dx = = | e V? dy = 1 - of; 4 In ) 
Ry у 7T 4 2 


/2+Ina 





Thus 1 — Ф(5 + In A) = 025 or A} + In 2) = 0.75 
From Table A (Appendix A), we find that ®(0.674) = 0.75. Thus 
} + In 4 = 0.674 А = 1.19 


Then the Neyman-Pearson test is 


H; 
x 20.674 


Ho 
Since x = 0.6 < 0.674, we determine that signal so(t) was transmitted. 
(b) By Eq. (8.2), we have 


0.674 
Py= POSU | х\н) а 2 gà dx 
Ro \/ 217 -a 
1 
Уп 





-0.326 
f e^? dy = Ф(—0.326) = 0.3722 


8.10. Derive Eq. (8.21). 


274 


8.11. 


DECISION THEORY [CHAP. 8 


By Eq. (8.19), the Bayes’ risk is 
С = Coo P(Do| Ho)P(Ho) + Cio P(D, | Ho)P(Ho) + Cor P(Do| H))P(H ) + Cy, P(D, | H P(A) 


Now we can express 
PIDIH) | fecit dx i20,1;j20,1 (8.31) 
Ri 
Then C can be expressed as 
С = Coo P(Ho) | f(x| Ho) dx + сънна | f(x| Ho) dx 
0 1 


(8.32) 
+ cu | f(x| Hy) dx + сын) | Д(х|Н,) dx 


Since Ry, o R, = Sand Ry с К, = ф, we can write 


| f(x| Hj) dx = [roi dx — | S(x|H) dx = 1 -| S(x] H) dx 
Ro s Ry Ry 
Then Eq. (8.32) becomes 
€ = Coo P(Ho) + C9 P(H,) + | ШС — Coo)P(Ho) f(x | Ho — [Cor — С,)РН,) (ХіН ))} dx 
КІ 


The only variable in the above expression is the critical region R,. By the assumptions [Eq. (8.20) Cio > 
Coo and Се, > C,,, the two terms inside the brackets in the integral are both positive. Thus, C is mini- 
mized if R, is chosen such that 


(Cor — C,0P(GI f(x] Ho) > (Со — Coo) P(Ho) f(x] Ho) 
for all x e R,. That is, we decide to accept Н, if 

(Co, — C,)P(H (x | Hy) > (Cio — Coo) Pl Ho) f(x | Ho) 
In terms of the likelihood ratio, we obtain 


_ f(x|H) z (Cio — Co9)P(H o) 
f(x| Ho) ну (Cor — Ci)P(H ) 





A(x) 


which is Eq. (8.21). 


Consider a binary decision problem with the following conditional pdf's: 

f(x| Ho) = 3e 

f(x| Hi) = e? 
The Bayes' costs are given by 

Coo = Cy, = 0 Co, = 2 Cio -1 
(а) Determine the Bayes’ test if P(H,) = 2 and the associated Bayes’ risk. 
(b) Repeat (a) with Р(Н,) = 1. 
(а) The likelihood ratio is 
A(x) = SEE = Vs ze (8.33) 
By Eq. (8.21), the Bayes' test is given by 

ma- Hi 


ae“ 2 - or e PI24 
Ho (2 = oy Ho 


CHAP. 8] 


DECISION THEORY 


275 


Taking the natural logarithm of both sides of the last expression yields 
Н, 


|x| S — In(3) = 0.693 
Ho 
Thus, the decision regions are given by 


R, = {x: |x| > 0.693} R, 
0.693 0.693 
Then P, = рн) = | 1e7l* dx zl ех dx = 0.5 
-0.693 o 
-0.693 
Py = оын) = | 


еа | eni] 
-0 0.693 о 
and by Eq. (8.19), the Bayes’ risk is 


693 


{x: |x] < 0.693} 


e ?* dx = 025 
С = P(D,| Ho)P(Ho) + 2P(Do| H,)P(H,) = (0.53) + 2(0.25X4) = 0.5 
(b) The Bayes’ test is 





Н, 
= 4 ог e 1х| 2 4 
нњ 2-0} ^ "m 
Again, taking the natural logarithm of both sides of the last expression yields 


H, 


|x| $ — In(4) = 1.386 
Ho 
Thus, the decision regions are given by 


Ro = {х: |x| > 1.386} 
Then 


R, = {x: |x| < 1.386} 
1.386 
P, = PID, 1H) =2 | 

о 


je ^ dx = 0.75 
Ри = P(Dy|H,) = 2 | e^?* dx = 0.0625 

1.386 
and by Eq. (8.19), the Bayes’ risk is 


С = (0.751) + 2(0.0625\4) = 0.4375 


8.12. Consider the binary decision problem of Prob. 8.11 with the same Bayes’ costs. Determine the 
minimax test. 


From Eq. (8.33), the likelihood ratio is 


no = LED _ 


2e 7 
fix | Ho) 
In terms of P(H,), the Bayes’ test [ Eq. (8.27)] becomes 


H H 

noni АН) онт Po) 
Ho 2 1 — P(Ho) Ho 4 1 — Р(Н,) 

Taking the natural logarithm of both sides of the last expression yields 

Hi 


411 — P(H9)] 
$1 = $ 
|x] 5 п РН) 


Ro = {х: |x] > ô} 


(8.34) 
For Р(Н,) > 0.8, ô becomes negative, and we always decide Н. For P(Ho) < 0.8, the decision regions are 


К, = {x: |x| < ô} 


276 


8.13. 


DECISION THEORY [CHAP. 8 


Then, by setting Coo = C,, = 0, Co, = 2, and С,е = 1 in Eq. (8.19), the minimum Bayes’ risk C* can be 
expressed as a function of P(H,) as 


д -ő a 
C*[P(Ho)] = Р(Н,) f le^ dx + 2[1 — ral | е2* dx + { e7? a| 
-5 А 


в © 
= P(H,) [ e7* dx + 4[1 — P(Ho)] { e^? dx 
o 5 
= P(H,)\(1 — 675 + 2[1 — P(Hyje 2° (8.35) 
From the definition of 6 [Eq. (8.34)], we have 
‚ _4(1— PlHo)] 
P(Ho) 
К P(Ho) - P?(Ho) 
a 28 _ 
Thus e SAPA "5 * ett PHO 
Substituting these values into Eq. (8.35), we obtain 
8Р(Н,) — 9P^(H,) 
8[1 — P(Ho)] 


Now the value of Р(Н,) which maximizes C* can be obtained by setting dC*[ P(H,)]/d P(H,) equal to zero 
and solving for P(H,). The result yields P(H,) = 2. Substituting this value into Eq. (8.34), we obtain the 


following minimax test: 


C*[P(H9)] = 


2 = 0.69 


Suppose that we have n observations X;, i= 1, ..., n, of radar signals, and X; are normal iid 
r.v.’s under each hypothesis. Under Hy, X; have mean ш, and variance о?, while under H,, X; 
have mean д, and variance c?, and д, > цо. Determine the maximum likelihood test. 


By Eq. (2.52) for each Х,, we have 





1 1 , 
foil Hg = ло Е 572 (х; — Ho) ] 


1 


1 
Д(х|Н,) = Vine e - zs (xi— m| 


Since the X; are independent, we have 
n 1 ] n 
f(x] Ho) = П fo] Ho) = ino Е 252 X (x; — „| 
1 n 
ІН) = Пл іН, у-у” эт Ema] 


With и, — Ho > 0, the likelihood ratio is then given by 


H, 1 z 
A(x) = ore = eis РЕ — Ho)X; — піву? — 2 


Hence, the maximum likelihood test is given by 





Hi 


1 
exp 2 [È 2(u, — uo)x; — ту — Ho Ji zl 


Ho 
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Taking the natural logarithm of both sides of the above expression yields 


note ух ze 


Ho 


n(p,? — ро?) 
7 283 —— 


or 
Equation (8.36) indicates that the statistic 


l n 
4X, X)o- Y XX 
і=1 
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(8.36) 


provides enough information about the observations to enable us to make а decision. Thus, it is called the 


sufficient statistic for the maximum likelihood test. 


Consider the same observations X;, i = 1,..., n, of radar signals as in Prob. 8.13, but now, under 
Ho, X; have zero mean and variance go’, while under H,, X; have zero mean and variance с; 2, 


and e,? > со2. Determine the maximum likelihood test. 


In a similar manner as in Prob. 8.13, we obtain 





f(xX| Ho) = Ona," exo( - 552 У x) 


Д(х|Н,) = бла y ew - ig Уг) 





With a,? — a? > 0, the likelihood ratio is 


SIH) (оо ү 
А0 = Fx Ho) -(2) = 


and the maximum likelihood test is 


n 2 2 n Hy 

To 017 — 90 ‘| > 
— | expl | ———- X 21 

(=) ( 2a,70,7 ) L Ho 


Taking the natural logarithm of both sides of the above expression yields 


n К в, 2a970,? 
Y x? 2 n|In| — 5—5 
i=l Ho To 9, — fo 








Note that in this case, 


Is the sufficient statistic for the maximum likelihood test. 


(8.37) 


In the binary communication system of Prob. 8.6, suppose that we have n independent obser- 


vations X; = X(t; i = 1,..., n, where0 < t, <e «t, x T. 


(a) Determine the maximum likelihood test. 
(b) Find P, and Py for n = 5 and n = 10. 


(a) Setting zo = 0 and д, = 1 in Eq. (8.36), the maximum likelihood test 15 


i=l 


ai= 
SNS 
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(b) Let y= 5X, 


iz1 


Then by Eqs. (4.108) and (4.112), and the result of Prob. 5.60, we see that Y is a normal r.v. with zero 
mean and variance 1/n under Hy, and is a normal r.v. with mean 1 and variance 1/n under H,. Thus 


P, = P(D, | Ho) = f soin dy -y Jem dy 


! n —12/2 
= e^? d; = | — @(,/n/2) 
„л Ул? 
Jn 


Py = P(Do| H4) -| fH) dy = | e7020- ду 
Ко я Jua 











- n2 
-zl e 72 dz = Ф /n/2) = 1 — O/n/2) 
л — а 


Note that P, = Pu. Using Table A (Appendix A), we have 
P, = Py = 1 — (1.118) = 0.1318 for n= 5 
Р, = Py = 1 — Ф(1.581) = 0.057 for n = 10 


8.16. In the binary communication system of Prob. 8.6, suppose that s,(t) and s,(t) are arbitrary 
signals and that n observations of the received signal x(t) are made. Let n samples of so(t) and 
s,(t) be represented, respectively, by 


So = [S01 502, +++» Sos]. and $ = [535 31225 Sind” 
where T denotes “transpose of.” Determine the MAP test. 


For each X,, we can write 
1 
I(x; | Ho) = Jin Е 3 (x, - 71 
1 
flH) = Яш Е 3 (x; — s 


Since the noise components are independent, we have 
S&H) = Поин) j=0,1 
і=1 


and the likelihood ratio is given by 


[х\н _ 
SEIH) 





A(x) = 


i 
e 
= 
so) 


ыыы] т E 


= e| $ tsu — soj)x, — 3 (547? — so] 


Thus, by Eq. (8./5), the MAP test is given by 


n н, Р н 


Taking the natural logarithm of both sides of the above expression yields 


Н 
Уби — sod, 2 T] 7 zl +462 = so?) (8.38) 
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Supplementary Problems 


8.17. Let (X,,..., Х„ bea random sample of a Bernoulli r.v. X with pmf 
feGpepü-p'"* х= 0,1 
where it is known that 0 < p < 4. Let 
Hy: p-i 
Hy: p=p (<4) 
and n = 20. As a decision test, we use the rule to reject Ho if У", x, < 6. 
(a) Find the power function g(p) of the test. 
(b) Find the probability of a Type | error a. 
(c) Find the probability of a Type IJ error £ (i) when p, = 4 and (ii) when p, = т. 


6 


Ans. (a) g(p)— p (psa —py?^* O<p<4 


(b) а= 0.0577; (с) (i) Ё = 0.2142, (ii) В = 0.0024 


8.18. Let (X,,..., X,) be a random sample of a normal г.у. X with mean и and variance 36. Let 
Hy: 4250 
H: w=55 


As a decision test, we use the rule to accept Ho if X < 53, where x is the value of the sample mean. 


(a) Find the expression for the critical region R,. 
(b) Find хапа fi for n = 16. 


1 п 
Ans. (а) К, = {(x,,..., Xy); X > 53} where X=— У x; 
Л з=] 
(Б) а = 0.0228, B = 0.0913 
8.9. Let (X,,..., X,) be a random sample of a normal г.у. X with mean y and variance 100. Let 
Hy: р = 50 
Hy: р= 55 


As a decision test, we use the rule that we reject Н, if x > c. Find the value of c and sample size n such that 
a = 0.025 and f = 0.05. 


Ans. c= 52.718, п = 52 


820. Let X be a normal r.v. with zero mean and variance c?. Let 


Hy: а? = | 
H: о? =4 


Determine the maximum likelihood test. 


Hi 
Ans. |x| Z 1.36 
Ho 


821. Consider the binary decision problem of Prob. 8.20. Let Р(Н,) = 4 and P(H,) = 4. Determine the MAP 
test. 


Hi 
Ans. |x| Z 1.923 


Ho 
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8.22. Consider the binary communication system of Prob. 8.6. 


(a) Construct a Neyman-Pearson test for the case where « = 0.1. 
(b) Find f. 


Hi 
Ans. (a) |x| 2 1282; (Ы) B-0611 


Ho 
8.3. Consider the binary decision problem of Prob. 8.11. Determine the Bayes’ test if P(Ho) = 0.25 and the 
Bayes' costs are 


Coo = C, = 0 Со = 1 Сло = 2 
Hi 


Ans. |x| S 1.10 
Ho 


Chapter 9 


Queueing Theory 


9.1 INTRODUCTION 


Queueing theory deals with the study of queues (waiting lines). Queues abound in practical situ- 
ations. The earliest use of queueing theory was in the design of a telephone system. Applications of 
queueing theory are found in fields as seemingly diverse as traffic control, hospital management, and 
time-shared computer system design. In this chapter, we present an elementary queueing theory. 


92 QUEUEING SYSTEMS 
A. Description: 


À simple queueing system is shown in Fig. 9-1. Customers arrive randomly at an average rate of 
А„. Upon arrival, they are served without delay if there are available servers; otherwise, they are 
made to wait in the queue until it is their turn to be served. Once served, they are assumed to leave 
the system. We will be interested in determining such quantities as the average number of customers 
in the system, the average time a customer spends in the system, the average time spent waiting in the 


queue, etc. 
Arrivals Departures 
Queue Service 


Fig. 9-1 A simple queueing system. 


The description of any queueing system requires the specification of three parts: 


The arrival process 
The service mechanism, such as the number of servers and service-time distribution 


The queue discipline (for example, first-come, first-served) 


B. Classification: 


The notation A/B/s/K is used to classify a queueing system, where A specifies the type of arrival 
process, B denotes the service-time distribution, s specifies the number of servers, and K denotes the 
capacity of the system, that is, the maximum number of customers that can be accommodated. If K is 
not specified, it is assumed that the capacity of the system is unlimited. For example, an M/M/2 
queueing system (M stands for Markov) is one with Poisson arrivals (or exponential interarrival time 
distribution), exponential service-time distribution, and 2 servers. An М/С/І queueing system has 
Poisson arrivals, general service-time distribution, and a single server. A special case is the M/D/1 
queueing system, where D stands for constant (deterministic) service time. Examples of queueing 
systems with limited capacity are telephone systems with limited trunks, hospital emergency rooms 
with limited beds, and airline terminals with limited space in which to park aircraft for loading and 
unloading. In each case, customers who arrive when the system is saturated are denied entrance and 
are lost. 


C. Useful Formulas 


Some basic quantities of queueing systems are 
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the average number of customers in the system 

the average number of customers waiting in the queue 

the average number of customers in service 

the average amount of time that a customer spends in the system 

the average amount of time that a customer spends waiting in the queue 
the average amount of time that a customer spends in service 


AANTD 


Many useful relationships between the above and other quantities of interest can be obtained by 
using the following basic cost identity: 

Assume that entering customers are required to pay an entrance fee (according to some rule) to 
the system. Then we have 


Average rate at which the system earns = A, x average amount an entering customer (9.1) 
pays where A, is the average arrival rate of entering customers 


À, = lim ^9 


а 
1200 


and X(t) denotes the number of customer arrivals by time t. 
If we assume that each customer pays $1 per unit time while in the system, Eq. (9.1) yields 
L=i,w (9.2) 
Equation (9.2) is sometimes known as Little’s formula. 


Similarly, if we assume that each customer pays $1 per unit time while in the queue, Eq. (9.1) 
yields 


L, = АИ, (9.3) 
If we assume that each customer pays $1 per unit time while in service, Eq. (9.1) yields 
„= 2,0, (9.4) 


Note that Eqs. (9.2) to (9.4) are valid for almost all queueing systems, regardless of the arrival process, 
the number of servers, or queueing discipline. 


9.3 BIRTH-DEATH PROCESS 


We say that the queueing system is in state S, if there are n customers in the system, including 
those being served. Let M(t) be the Markov process that takes on the value n when the queueing 
system is in state S, with the following assumptions: 


1. If the system is in state S,, it can make transitions only to S,_,; or $,,,, n È 1; that is, either a 
customer completes service and leaves the system or, while the present customer is still being 
serviced, another customer arrives at the system; from Sg, the next state can only be $,. 

2. If the system is in state 5, at time t, the probability of a transition to $,,, in the time interval 
(t, t + At) is a, At. We refer to a, as the arrival parameter (or the birth parameter). 

3. If the system is in state S, at time t, the probability of a transition to $,. , in the time interval 
(t, £ + At) is d, At. We refer to d, as the departure parameter (or the death parameter). 


The process N(t) is sometimes referred to as the birth-death process. 
Let p,(t) be the probability that the queueing system is in state S, at time /; that is, 


Palt) = P(N() = nj (9.5) 
Then we have the following fundamental recursive equations for N(t) (Prob. 9.2): 
р) = (a, + dpa)Pa(t) + а, -1рһ- (0 + а, + 1Pn+ (0 п> 1 (9 6) 


polt) = — (ag + do)polt) + dip,(t) 
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Assume that in the steady state we have 


lim p,(t) = p, (9.7) 


1 00 
and setting po(t) and p,(t) = 0 in Eqs. (9.6), we obtain the following steady-state recursive equation: 
(а, + ФР» = аһ-\рһ-1\ + dn+iPn+i п> 1 (9.8) 
and for the special case with dọ = 0, 
do Po = 4, Py (9.9) 


Equations (9.8) and (9.9) are also known as the steady-state equilibrium equations. The state transition 
diagram for the birth-death process is shown in Fig. 9-2. 


а а, a, | а, 
4, d; d, 4, ,, 
Fig. 9-2 State transition diagram for the birth-death process. 


Solving Eqs. (9.8) and (9.9) in terms of py, we obtain 





a 
Pi = 7 Po 
Ug a 
рз = 1d. Po 
172 (9.10) 
_ дойр `` d. 
п dd, . d, Po 
where ро can be determined from the fact that 
V dg , 40% 
„= [162046 ie) =1 (9.11 
A | ааа JP { 


provided that the summation in parentheses converges to a finite value. 


94 THE M/M/1 QUEUEING SYSTEM 


In the M/M/1 queueing system, the arrival process is the Poisson process with rate А (the mean 
arrival rate) and the service time is exponentially distributed with parameter и (the mean service rate). 
Then the process N(t) describing the state of the M/M/1 queueing system at time t is а birth-death 
process with the following state independent parameters: 


a, = А п> 0 d, = р п > 1 (9.12) 
Then from Eqs. (9.10) and (9.11), we obtain (Prob. 9.3) 


po=1-4=1~p (9.13) 


À n 
р, = ( - 30 =(1 — pp (9.14) 


where p = A/u < 1, which implies that the server, on the average, must process the customers faster 
than their average arrival rate; otherwise the queue length (the number of customers waiting in the 
queue) tends to infinity. The ratio р = A/u is sometimes referred to as the traffic intensity of the 
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system. The traffic intensity of the system is defined as 


mean service time mean arrival rate 
mean interarrival time — mean service rate 





Traffic intensity — 


The average number of customers in the system is given by (Prob. 9.4) 








p À 
= = —— 9.15 
i-p pod (9.15) 
Then, setting 2, = 4 in Eqs. (9.2) to (9.4), we obtain (Prob. 9.5) 
1 1 
= (9.16) 
и-5 wl ~ p) 
4 р 
W, = = (9.17) 
* wu—2) wl — p) 
2 2 
L,-—~—--— (9.18) 








9.5 THE M/M/s QUEUEING SYSTEM 


In the M/M/s queueing system, the arrival process is the Poisson process with rate 4 and each of 
the s servers has an exponential service time with parameter и. In this case, the process N(t) describ- 
ing the state of the M/M/s queueing system at time t is a birth-death process with the following 
parameters: 


nu O<n<s 


a, = А п> 0 4 - | 
su п2 5 


(9.19) 


Note that the departure parameter d, is state dependent. Then, from Eqs. (9.10) and (9.11), we obtain 
(Prob. 9.10) 








s (sp) (spy? 
_ (Sp) (50) _ 2 
Po LX паа (9.20) 
$ n 
er) Po n«s 
- i 2 
Pa p's (9.21) 
3! Po п>$ 


where p = A/(su) < 1. Note that the ratio p = A/(su) is the traffic intensity of the M/M/s queueing 
system. The average number of customers in the system and the average number of customers in the 
queue are given, respectively, by (Prob. 9.12) 


A spy 
Hu * so — py Po (9.22) 
p(spy A 
L, = —— = —— . 
agg 57, (2.23) 
By Eqs. (9.2) and (9.3), the quantities W and W, are given by 
_Ł (9.24 
= 7 .24) 
L 1 
= — = Ww _ — 
W, 1 , (9.25) 
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9.6 THE M/M/1/K QUEUEING SYSTEM 


In the M/M/I/K queueing system, the capacity of the system is limited to К customers. When the 
system reaches its capacity, the effect is to reduce the arrival rate to zero until such time as a cus- 
tomer is served to again make queue space available. Thus, the M/M/1/K queueing system can be 
modeled as a birth-death process with the following parameters: 


à Osn<k d > | (9.26) 
а = = , 
" 0 п> К (cH nz 
Then, from Eqs. (9.10) and (9.11), we obtain (Prob. 9.14) 
|| 1-QGp —l-p 
Рот Gye) q- Pf] 027) 
AY (1 — p)p” 
n - (2) p 0 n=1,...,K (9.28) 


where p = A/y. It is important to note that it is no longer necessary that the traffic intensity р = A/p 
be less than 1. Customers are denied service when the system is in state K. Since the fraction of 
arrivals that actually enter the system is 1 — px, the effective arrival rate is given by 


A, = A(1 — px) (9.29) 


The average number of customers in the system is given by (Prob. 9.15) 


1—(K + Dp + Крк! A 
L = -___ aww = — 9.30 
(I — pl — p**!) p u ( ) 
Then, setting A, = А, in Eqs. (9.2) to (9.4), we obtain 
L L 
W -—-————— (9.31) 
A, АІ — рк) 
1 
W,=W-- (9.32) 
H 
L, = A.W, = A — pj)W, (9.33) 


9.7 THE M/M/s/K QUEUEING SYSTEM 


Similarly, the M/M/s/K queueing system can be modeled as a birth-death process with the fol- 
lowing parameters: 











A O<n<K пи O<n<s 
= = 9. 
n n п> К " t п> 5 06.34) 
Then, from Eqs. (9.10) and (9.11), we obtain (Prob. 9.17) 
sol (spy (spy 1 — pk-sti -1 
= —- — 9.35 
Po [> п! + s! 1—р (9.35) 
x Do n«s 
Pn =) ngs (9.36) 
P Po s<n<kK 


where p = A/(su). Note that the expression for p, is identical in form to that for the M/M/s system, 
Eq. (9.21). They differ only in the ро term. Again, it is not necessary that р = А/(5и) be less than 1. The 
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average number of customers in the queue is given by (Prob. 9.18) 


L= p(psY 
a= Po Ур р) 


The average number of customers in the system is 


{1 —[1 + (1 — PXK — s)]p*~} 


A A 
L=L,+—~=L,+—(l — px) 
H H 


The quantities W and W, are given by 


L 
Ws=—=hL — 
А, A" 
L L 
W = = 4 
7 A, ML — px) 


Solved Problems 


9.1. Deduce the basic cost identity (9.1). 
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(9.37) 


(9.38) 


(9.39) 


(9.40) 


Let T be a fixed large number. The amount of money earned by the system by time T can be com- 
puted by multiplying the average rate at which the system earns by the length of time Т. On the other 
hand, it can also be computed by multiplying the average amount paid by an entering customer by the 
average number of customers entering by time T, which is equal to A,T, where 4, is the average arrival rate 


of entering customers. Thus, we have 


Average rate at which the system earns x T = average amount paid by an entering customer x (4,7) 


Dividing both sides by T (and letting T > oo), we obtain Eq. (9.1). 


9.2.  Derive Eq. (9.6). 


From properties 1 to 3 of the birth-death process N(t), we see that at time t + At, the system can be in 


state S, in three ways: 


1. By being in state $, at time t and no transition occurring in the time interval (t, t + At). This happens 
with probability (1 — a, At(1 — d At) ж 1—(а„+4„) At [by neglecting the second-order effect 


a, d, (At)? ]. 


2. By being in state S,_, at time t and a transition to S, occurring in the time interval (t, t + At). This 


happens with probability a,.. , At. 


3. By being in state $,,, at time t and a transition to $, occurring in the time interval (t, £ + At). This 


happens with probability d,, , At. 
Let bít) = PLN(t) = i] 
Then, using the Markov property of N(t), we obtain 


рї + At) = [1 — (a, + d,) At]p (t) + a,-, At p, (4) + 4,., Аер, 1(0) 


Polt + At) = [1 — (ao + do) At]po(t) + d, At pilt) 
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Rearranging the above relations 
р! + At) — рї) 
At 
Polt + At) — Polt) _ 
At 


= — (a, + d,)p,(t) a, ip, y(t) + das Pao a(t) nzl 


— (ag + do)Polt) + dipti) 


Letting At > 0, we obtain 


pt) = — (8, + dp + а, ip, + 41р) nel 
polt) = — (ao + do)polt) + dipi(t) 


9.3. Derive Eqs. (9.13) and (9.14). 
Setting a, = 4, dọ = 0, and d, = win Eq. (9.10), we get 


A 
Ру = = Po = рро 
и 


Xi w 1 
Ур. = Po У P= po —— = 1 lp} <1 
n=0 n=0 1-р 
from which we obtain 
À 
р=1-р=1- - 
р 


AV ANAY 
Pa = () Po = (1 — р)р" = (: - A5 
[л LJ NA 
9.4. Derive Eq. (9.15). 


Since p, is the steady-state probability that the system contains exactly n customers, using Eq. (9.14), 
the average number of customers in the M/M/1 queueing system is given by 


0 oo 


L= Упр, = У all — р)р" = (1 - 9) пр" (9.41) 


п=0 п= 0 


where р = A/u < 1. Using the algebraic identity 





¥ nx" -yog Ix] « 1 (9.42) 


we obtain 


9,5. Derive Eqs. (9.16) to (9.18). 
Since 4, = A, by Eqs. (9.2) and (9.15), we get 
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9.6. 


9.7. 
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which is Eq. (9.16). Next, by definition, 
W,-W-W, (9.43) 
where W, = 1/џ, that is, the average service time. Thus, 
! 1 4 р 
Саса p щи- Щ1—р) 
which is Eq. (9.17). Finally, by Eq. (9.3), 





4 


42 р? 


L, = АЙ, = = — 
OU Mu-A l-p 





which is Eq. (9.18). 


Let W, denote the amount of time an arbitrary customer spends in the M/M/1 queueing system. 
Find the distribution of W,. 


We have 


2 
P{W, <a} 2 У P{W, x a|n in the system when the customer arrives} 
n=0 


x P{n in the system when the customer arrives} (9.44) 


where n is the number of customers in the system. Now consider the amount of time W, that this customer 
will spend in the system when there are already n customers when he or she arrives. When n = 0, then 
W, = Waa that is, the service time. When n > 1, there will be one customer in service and n — 1 customers 
waiting in line ahead of this customer's arrival. The customer in service might have been in service for some 
time, but because of the memoryless property of the exponential distribution of the service time, it follows 
that (see Prob. 2.57) the arriving customer would have to wait an exponential amount of time with param- 
eter и for this customer to complete service. In addition, the customer also would have to wait an exponen- 
tial amount of time for each of the other n — 1 customers in line. Thus, adding his or her own service time, 
the amount of time W, that the customer will spend in the system when there are already n customers when 
he or she arrives is the sum of n + 1 iid exponential r.v.’s with parameter и. Then by Prob. 4.33, we see that 
this г.у. is a gamma г.у. with parameters (n + 1, p). Thus, by Eq. (2.83), 


. . a 1 n 
P(W, x a|n in the system when customer arrives] — f ре“ ш? dt 
0 п! 


From Eq. (9.14), 


AMAY 
P{n in the system when customer arrives} = p, = (: — 28 
и 


Hence, by Eq. (9.44), 





Ыр" OSG) e] 
Fy, = PIW, sa} = “——{10—-}-) a 
н. Pl o) Xll n! H/NA 
К а а = (At 
= AC Ае > " d 
= [aeo dt dew? (9.45) 
0 


Thus, by Eq. (2.79), W, is an exponential r.v. with parameter и — À. Note that from Eq. (2.99), E(W,) = 
l/(u — А), which agrees with Eq. (9.16), since W = E(W,). 


Customers arrive at a watch repair shop according to a Poisson process at a rate of one per 
every 10 minutes, and the service time is an exponential r.v. with mean 8 minutes. 


(a) Find the average number of customers L, the average time a customer spends in the shop 
W, and the average time a customer spends in waiting for service W,. 


CHAP. 9] QUEUEING THEORY 289 


9.8. 


9.9, 


(b) Suppose that the arrival rate of the customers increases 10 percent. Find the corresponding 
changes in L, W, and W,. 


(a) The watch repair shop service can be modeled as an M/M/1 queueing system with 4 = 45, = 
from Eqs. (9.15), (9.16), and (9.43), we have 


a= 
4 
> 
= 
e 


À 1 
L=—— =- = 4 
u-A $-—39 
W ! ! 40 minut 
= —— = —— = 40 minutes 
и-А %- 15 
W, = W — W, = 40 — 8 = 32 minutes 
(b) NowA= 4, u = 4. Then 
A П 
L=—— = = 8 
H—-À $8—5 
1 1 . 
W =; 217 g4 mmus 


И = W — W = 72 — 8 = 64 minutes 


It can be seen that an increase of 10 percent in the customer arrival rate doubles the average number 
of customers in the system. The average time a customer spends in queue is also doubled. 


A drive-in banking service is modeled as an M/M/1 queueing system with customer arrival rate 
of 2 per minute. It is desired to have fewer than 5 customers line up 99 percent of the time. How 


fast should the service rate be? 


From Eq. (9.14), 
. x ao 5 À 
Р{5 or more customers in the system} = Y p, = У (1 ~ р)р" = р p=- 
n=5 a-5 и 
In order to have fewer than 5 customers line ир 99 percent of the time, we require that this probability be 


less than 0.01. Thus, 


from which we obtain 


As 2° 
5> —— =— = 320 024 
£ 0.01 001 3200 ar и25 


Thus, to meet the requirements, the average service rate must be at least 5.024 customers per minute. 


People arrive at a telephone booth according to a Poisson process at an average rate of 12 per 
hour, and the average time for each call is an exponential r.v. with mean 2 minutes. 


(a) What is the probability that an arriving customer will find the telephone booth occupied? 


(b) It is the policy of the telephone company to install additional booths if customers wait an 
average of 3 or more minutes for the phone. Find the average arrival rate needed to justify a 


second booth. 


(a) The telephone service can be modeled as an М/М/1 queueing system with 4 = 4, р = 4, and p= 


Ми = 2. The probability that an arriving customer will find the telephone occupied is P(L > 0), where 
L is the average number of customers in the system. Thus, from Eq. (9.13), 


PL >»>0)=1—р,=1—(1—р)=р=#=04 
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9.11. 


9.12. 
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(b) From Eq. (9.17), 
4 4 
W, = = ———— > 3 
а щи А) 0.50.5 – A) 





from which we obtain А > 0.3 рег minute. Thus, the required average arrival rate to justify the second 
booth is 18 per hour. 


Derive Eqs. (9.20) and (9.21). 
From Eqs. (9.19) and (9.10), we have 











П à (4) n< (9.46) 
= Toh -13 s . 
Pn Poil (k + 0и Po п) n! 
5-1 À n-i A (5) 1 
= == pog -] Jas n>s 9.47 
Ps Poll acc П su Po nj sist (9.47) 
Let p = A/(su). Then Eqs. (9.46) and (9.47) can be rewritten as 
(sp) Py ons 
n! 
Pr = 
^s 
Р Ро n2s 


Using the summation formula 





{х| < 1 (9.48) 
we obtain Eq. (9.20); that is, 
_ s-i (spy S a - 1 bea (sp) B 
ГАР HLR] OLE m tap 


provided р = A/(su) < 1. 








Consider an M/M/s queueing system. Find the probability that an arriving customer is forced to 
join the queue. 


An arriving customer is forced to join the queue when all servers are busy—that is, when the number 
of customers in the system is equal to or greater than s. Thus, using Eqs. (9.20) and (9.21), we get 


. , = S (spy 
P(a customer is forced to join queue) = Ур, = po a Y p" = ро па) 
a-s Sines 5. —p 
(soy 
= 80) (9.49) 


y (sp)’ + (spy 


до па) 





Equation (9.49) is sometimes referred to as Erlang's delay (or C) formula and denoted by C(s, А/и). Equation 
(9.49) is widely used in telephone systems and gives the probability that no trunk (server) is available for an 
incoming call (arriving customer) in a system of s trunks. 


Derive Eqs. (9.22) and (9.23). 
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Equation (9.21) can be rewritten as 





(sp)" © 
n! Po n S 
р, = 
"s 
P Po п> 5 





Then the average number of customers in the system is 


L = E(N) = In У Je 








Using the summation formulas, 





ye = ers Ix| «1 (9.50) 
k kel _ n 
pr ese" = Ixt<] (9.51) 











and Eq. (9.20), we obtain 
(sey | zl p psp! —(s + Do + 30) 


Lon Y & (pF ( - р)? 
| b ! Gp | nu 4 ples) | 
= Ро)? nao n! + s! (1-- р)? 


p(spy 
"nsu em] 
a 
-+ 





p(spy _ p(spY 
s — prop sil - pp? 


Next, using Eqs. (9.21) and (9.50), the average number of customers in the queue is 


=sp + 


nes 


L,- Y - 3p, = Yn ux 


=s 


= po cer (п —5)р"7* = 








С 








_ Psp _ 
si(1 — py Po 


9.13. A corporate computing center has two computers of the same capacity. The jobs arriving at the 
center are of two types, internal jobs and external jobs. These jobs have Poisson arrival times 
with rates 18 and 15 per hour, respectively. The service time for a job is an exponential r.v. with 
mean 3 minutes. 

(a) Find the average waiting time per job when one computer is used exclusively for internal 
jobs and the other for external jobs. 
(b) Find the average waiting time per job when two computers handle both types of jobs. 


(a) When the computers are used separately, we treat them as two M/M/1 queueing systems. Let W,, and 
W,, be the average waiting time per internal job and per external job, respectively. For internal jobs, 
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9.14. 


9.15. 


QUEUEING THEORY 


A, = 18 = то and и, = 4. Then, from Eq. (9.16), 


For external jobs, 4; = i$ = 4 and џи, = 4, and 
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(b When two computers handle both types of jobs, we model the computing service as an M/M/2 


queueing system with 











_18+15_11 _1 à 3 
7760 02 573 Р 40 
Now, substituting s — 2 in Eqs. (9.20), (9.22), (9.24), and (9.25), we get 
(2p)? | 1-р 
= | 1+ 2р + = + 
Po | ?* X1— p) l+p 
4p? =) 2p 
L=29 + Е {Е} 
pee ley 1—р? 
L | 12 1 
wobo ioi 20 1d 


(9.52) 


(9.53) 


(9.54) 


Thus, from Eq. (9.54), the average waiting time per job when both computers handle both types of jobs 


is given by 


Ww, = _ 2438) Sc = 6.39 min 
1 — (46)°1 


From these results, we see that it is more efficient for both computers to handle both types of jobs. 


Derive Eqs. (9.27) and (9.28). 


From Eqs. (9.26) and (9.10), we have 


iy А 
Pa = (2) Po = P"Po O<n<k 


From Eq. (9.11), po is obtained by equating 


n=0 n=0 
Using the summation formula 
к 1 E xKÉ*! 
Dx Ишти 
we obtain 
1—р (1— pp" 
Pos qp and "TII 


Note that in this case, there is no need to impose the condition that p = A/p < 1. 


Derive Eq. (9.30). 


(9.55) 


(9.56) 
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Using Eqs. (9.28) and (9.51), the average number of customers in the system is given by 


K K 
L-EN)- У np = — eu Ys. 
n=0 p 5-0 
1-р {0[Крк*! —(K + Dof +1] 
"| (1 — р)? | 
1 —(K + 1)of + Kp**! 


(= К = prn 


9.16. Consider the M/M/1/K queueing system. Show that 


L,=L—(1 — po) (9.57) 
1 
ES (9.58) 
A 
1 
=-(L+1) (9.59) 
A 


In the M/M/1/K queueing system, the average number of customers in the system is 


K K 
L-E(N)- Y np, and Ур, = 1 
n=0 л=0 
The average number of customers in the queue is 
K K 


K 

L, = E(N} = Zin- 1)p, = > np, — Ур, = 1 (1 — Po) 
a=1 n=0 л=1 

A customer arriving with the queue in state S, has a wait time T, that is the sum of n independent exponen- 

tial r.v.’s, each with parameter и. The expected value of this sum is n/u [Eq. (4.108)]. Thus, the average 

amount of time that a customer spends waiting in the queue is 


К п К 


1 1 
W, = ET) = = р, == np, = – і. 
‘ * D m u 


Simililarly, the amount of time that a customer spends in the system is 


K K K 
W=ET)= У o, t (Yo Уһ) -10+0 


n=l H =0 n-0 


Note that Eqs. (9.57) to (9.59) are equivalent to Eqs. (9.31) to (9.33) (Prob. 9.27). 


9.17. Derive Eqs. (9.35) and (9.36). 
As in Prob. 9.10, from Eqs. (9.34) and (9.10), we have 








TI А Gy l n< (9.60) 
= = 5 . 
Pa = Poll ky Uu А т 
5-1 A п 1 A Gy 1 
Pr = —_ — = —-] — szEnxk (9.61) 
^П (k + 1) à sa PA] sts 
Let p = A/(sp). Then Eqs. (9.60) and (9.61) can be rewritten as 
(sp) po nes 
n! 
Ps = ps 





| Ро SSnskK 
s! 
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which is Eq. (9.36). From Eq. (9.11), pp is obtained by equating 


(spy K ps 
LS |- 


Кі 


8-1 


K s 
Ур, = ^| У 
п= 0 п= 0 


Using the summation formula (9.56), we obtain 











8-3 (spy ni K . GC ) -1 
AES у 
_ 5 \ (sp) (spy — р“ 2 
die n 5 silo) 


which is Eq. (9.35). 


9.18. Derive Eq. (9.37). 


Using Eq. (9.36) and (9.51), the average number of customers in the queue is given by 


5 K 
Po 7 È (п — 5)р" 
$5 ncs 

















s K- 
Po (ot = sp" * = poy ee Y тр" 
' m=0 
- ip! ЦК apt к cse ta 0 
= Po s! ( — 99 
-p SP ol OK — Mos 
= Po Tq — pj T E + U РХК at) 


9.19, Consider an M/M/s/s queueing system. Find the probability that all servers are busy. 


Setting K = s in Eqs. (9.60) and (9.61), we get 


and p, is obtained by equating 


Thus 


The probability that all servers are busy is given by 
_ AV] 
Ps = Po Р "rim 


(p/s! _ 


PCO 
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(9.62) 


(9.63) 


(9.64) 


Note that in an M/M/s/s queueing system, if an arriving customer finds that all servers are busy, the 
customer will turn away and is lost. In a telephone system with s trunks, p, is the portion of incoming calls 
which will receive a busy signal. Equation (9.64) is often referred to as Erlang's loss (or B) formula and is 


commonly denoted as B(s, А/и). 


9.20. 


An air freight terminal has four loading docks on the main concourse. Any aircraft which arrive 


when all docks are full are diverted to docks on the back concourse. The average aircraft arrival 
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9.21. 


9.22. 


rate is 3 aircraft per hour. The average service time per aircraft is 2 hours on the main concourse 
and 3 hours on the back concourse. 
(a) Find the percentage of the arriving aircraft that are diverted to the back concourse. 


(b) Ifa holding area which can accommodate up to 8 aircraft is added to the main concourse, 
find the percentage of the arriving aircraft that are diverted to the back concourse and the 
expected delay time awaiting service, 


(а) The service system at the main concourse can be modeled as an M/M/s/s queueing system with 5 = 4, 
A=3, p= +, and A/p = 6. The percentage of the arriving aircraft that are diverted to the back con- 


course 1$ 


100 x P(all docks on the main concourse are full) 


From Eq. (9.64), 


64/4! 54 
P(all docks on the main concourse are full) = p, = NUM =— ж 0.47 
А 115 
У, (6"/п!) 
п= 0 


Thus, the percentage of the arriving aircraft that are diverted to the back concourse is about 47 
percent. 

(b) With the addition of a holding area for 8 aircraft, the service system at the main concourse can now be 
modeled as an M/M/s/K queueing system with s = 4, К = 12, and р = A/(sp) = 1.5. Now, from Eqs. 


(9.35) and (9.36), 
3 6" 6^ LE -1 
= -+ x 0.00024 
Po [25+ (ES) 00 


1.51244 
рэ = UM Po © 0.332 





Thus, about 33.2 percent of the arriving aircraft will still be diverted to the back concourse. 
Next, from Eq. (9.37), the average number of aircraft in the queue is 
1.5(6%) 


L, = 0.00024 


Then, from Eq. (9.40), the expected delay time waiting for service is 


Ll . 60565 _ де 3.022 hours 


WwW = = 
7 Ml - р.) 31 — 0.332) 


Note that when the 2-hour service time is added, the total expected processing time at the main 
concourse will be 5.022 hours compared to the 3-hour service time at the back concourse. 


Supplementary Problems 


Customers arrive at the express checkout lane in a supermarket in a Poisson process with a rate of 15 per 
hour. The time to check out a customer is an exponential r.v. with mean of 2 minutes. 


(а) Find the average number of customers present. 
(b) What is the expected idle delay time experienced by a customer? 
(с) What is the expected time for a customer to clear a system? 


Ans. (а) 1; (Б) 2 min; (c) 4 min 


Consider an M/M/1 queueing system. Find the probability of finding at least k customers in the system. 
Ans. р“ = (А/ш). 
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9.24. 


9.25. 


9.26. 


9.27. 


9.28. 


9.29. 


9.30. 
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In a university computer center, 80 jobs an hour are submitted on the average. Assuming that the computer 
service is modeled as an M/M/1 queueing system, what should the service rate be if the average turnaround 
time (time at submission to time of getting job back) is to be less than 10 minutes? 


Ans. 1.43 jobs per minute 


The capacity of a communication line is 2000 bits per second. The line is used to transmit 8-bit characters, 
and the total volume of expected calls for transmission from many devices to be sent on the line is 12,000 
characters per minute. Find (a) the traffic intensity, (b) the average number of characters waiting to be 
transmitted, and (c) the average transmission (including queueing delay) time per character. 


Ans. (a) 0.8; (b) 32; (с) 20 ms 


А bank counter is currently served by two tellers. Customers entering the bank join a single queue and go 
to the next available teller when they reach the head of the line. On the average, the service time for a 
customer is 3 minutes, and 15 customers enter the bank per hour. Assuming that the arrivals process is 
Poisson and the service time is an exponential r.v., find the probability that a customer entering the bank 
will have to wait for service. 


Ans. 0.205 


A post office has three clerks serving at the counter. Customers arrive on the average at the rate of 30 per 
hour, and arriving customers are asked to form a single queue. The average service time for each customer 
is 3 minutes. Assuming that the arrivals process is Poisson and the service time is an exponential r.v., find 
(a) the probability that all the clerks will be busy, (b) the average number of customers in the queue, and (c) 
the average length of time customers have to spend in the post office. 


Ans. (a) 0.237; (b) 0.237; (c) 3.947 min 


Show that Eqs. (9.57) to (9.59) and Eqs. (9.31) to (9.33) are equivalent. 
Hint: Use Eq. (9.29), 


Find the average number of customers L in the M/M/I/K queueing system when 4 = y. 
Ans. K/2 


A gas station has one diesel fuel pump for trucks only and has room for three trucks (including one at the 
pump). On the average trucks arrive at the rate of 4 per hour, and each truck takes 10 minutes to service. 
Assume that the arrivals process is Poisson and the service time is an exponential r.v. 

(a) What is the average time for a truck from entering to leaving the station? 

(b) What is the average time for a truck to wait for service? 

(c) What percentage of the truck traffic is being turned away? 


Ans. (a) 20.15 min; (P) 10.14 min; (с) 12.3 percent 
Consider the air freight terminal service of Prob. 9.20. How many additional docks are needed so that at 


least 80 percent of the arriving aircraft can be served in the main concourse with the addition of holding 
area? 


Ans. 4 


Appendix A 


Normal Distribution 


1 z 
Ф(2) = = | e7? dé 
(2) Da l. 


®(—z) = 1 — (2) 





Fig. A 
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Table A—Continued 














The material below refers to Fig. A. 


a 0.2 0.1 005 0025 001 0.005 
z,; 1282 1645 1960 2240 2576 2807 


Appendix B 


Fourier Transform 


В1 CONTINUOUS-TIME FOURIER TRANSFORM 


Definition: 


Table B-1 Properties of the Continuous-Time Fourier Transform 


x(t) X(o) 

x(t) X (о) 

x(t) X (a) 
Linearity a,x, (t) + a, x(t) a, X,(0) + a, Хдо) 
Time shifting x(t — tg) e 1X (o) 
Frequency shifting eleo x(t) X(@ — оо) 








| 1 
Time scaling x(at) Tal X (s 


Time reversal x(—t) X(—w) 
Duality X(t) 2nx(— o) 
dx(t) 


di jo X(o) 


Time differentiation 


dX 
Frequency differentiation (—jt)x(t) a 


Integration | x(t) dt лХ(0)(о) + x Х(о) 


= 0 
со 


Convolution x(t) ж x(t) = | X4(t)x,(t — т) dt X,(@)X До) 
1 


7 Х (о) * Xo) 
л 


Multiplication x, (t)x2(t) 


-l Г ХХ о — А) dà 
2л J-a 

Real signal x(t) = x(t) + x,(t) Х(о) = А(о) + jB(o) 

X(—«) = X*(o) 

Even component x,(t) Ке{ Х(о)} = Alw) 

Odd component x,(t) j Im(X(o)] = jB(o) 








© 1 eo 
Parseval's theorem | x(t) |? dt = э; | | X(@) |? do 
л -%0 
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Table B-2 Common Continuous-Time Fourier Transform Pairs 


é(t) 
би — to) e іо 
І 2n5(w) 
eieo 225(@ — wo) 
COS Wot т[9(0 — wo) + 0 + Wo)] 
sin Wot —jn[d(w@ — wo) — (0 + wo)] 





1 t>0 1 
«n - 44 («0 m3(w) + = 


e "wt) а> 0 





© 2т 
wo Y Aa — kao), Wo = T 
к= ~ о 








В.2 DISCRETE-TIME FOURIER TRANSFORM 


Definition: 


X(Q) = y x(n)e 7 7" x(n) — = f X(Q)e* dQ 


n=- ою - 


Table B-3 Properties of the Discrete-Time Fourier Transform 


x(n) X(Q) 
хп) X (9) 





x(n) X XQ) 
Periodicity x(n) X(Q + 2л) = X(Q) 
Linearity a,X,(n) + aj x(n) a, X (Q) + a, X (Q) 
Time shifting x(n — no) e f X(Q) 
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Table B-3—Continued 





eon x(n) 


х(—п) 


Frequency shifting 
Time reversal 











Frequency differentiation nx(n) 









п 





Accumulation Y x) 
к= - ж 
Convolution хү(п) ж x(n) = У x,(k)x2(n — k) 
k=-@ 
Multiplication х (и)х,(п) 





Real sequence x(n) = x,(n) + x,(n) 










Even component 
Odd component 


x(n) 
xo(n) 


1 x 
мт = z- | ixene an 






x 


Y 


a= 


Parseval’s theorem 
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ХО — 9) 
X(—Q) 
.4X(Q) 


dQ 


XOD + < XQ) 


—e m 
X (ОХ (О) 


1 
on ХО) & X;(Q) 
л 


1 
al 
X(Q) = AQ) + /8(0) 

X(—Q0) = X*(Q) 
Re{X(Q)} = A(Q) 
j Im{X(Q)} = jB(Q) 


` X (XAQ — 2) dà 


Li 









Table B-4 Common Discrete-Time Fourier Transform Pairs 











































x[n] 
1 п=0 
б(п) = 1 
e) t n #0 
2. ó(n — ng) е9" 
3. х(п) = 1 2nó(Q) 
4. еол 2л (О — Qo) 
5. cos Оп n[6(Q — Qo) + 0 + 9,)] 
6. sin on —jn|à(Q — Qo) — 0 + О„)] 
1 п>0 
. = n ð - 
7 чл) t n «0 (О) + 1—e 8 
1 
8. a’u(n) ja| «1 Ilu 
1 
9. (n+ а u(n) lal < 1 (2 ас Aj 
10 Jal jal <1 Lo d-4 | 
| а 1 — 2а cos Q + а? 
1 [nl sn, sin[ AN, + 3)] 
11. х(п) = at + 
0 |п[ > М, sin(Q/2) 
sin Wn 1 0x|Q| x W 
лп о<<тл җа) - 1 W «|Oj En 
= ы 2л 
У ó(n — KNo) Qo У 90 — КО) Q = —- 
к= – а к= – 0 No 
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Absorbing states, 168 
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Alternative hypothesis, 264 
Aperiodic states, 168 
Arrival parameter, 282 
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Autocorrelation, 162, 210 
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r.v., 44 
Birth-death process, 282 
Bivariate 
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Chebyshev inequality, 66 
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Complex random process, 161-162, 218 
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expectation, 85, 126 
mean, 85, 110 
probability, 7, 24 
probability density function (pdf), 83 
probability mass function (pmf), 83 
variance, 85, 110 
Confidence 
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Consistent estimator, 248 
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integral, 214 
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Counting process, 170 
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matrix, 89 
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Neyman-Pearson, 266, 272 

Decision theory, 264 
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limiting, 169 
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Poisson, 44, 68 


Index 


Index 


Distribution (continued) 
second-order, 162 
stationary, 169 
uniform, 45 

Distribution function, 37, 39, 49 
cumulative (cdf), 37 
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Efficient estimator, 247 
Ensemble, 161 
average, 162 
Equally likely events, 7, 20 
Ergodic, in the mean, 243 
process, 165 
Erlang's, delay (or C) formula, 290 
loss (or B) formula, 294 
Estimates, point, 247 
interval, 247 
Estimation, 247 
Bayes’, 255 
error, 249 
linear, 249 
mean square, 249 
maximum likelihood, 253 
mean square, 249 
parameter, 247 
Estimator, Bayes' 249 
best, 249 
biased, 251 
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efficient, 247 
maximum-likelihood, 248 
minimum mean square error, 249 
point, 247, 250 
unbiased, 247 
Events, 2,9 
certain, 3 
elementary, 2 
equally likely, 7, 20 
impossible, 3 
independent, 8 
mutually exclusive, 6 
and exhaustive, 8 
null, 3 
Expectation, 42, 125 
conditional, 85, 126 
Expected value (see Mean) 
Experiment, Bernoulli, 33 
random, 1 
Exponential, distribution, 46 
r.v., 46 


F 
Fourier series, 216, 236 
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Fourier series (continued) 

Perseval's theorem for, 237 
Fourier transform, 218, 240, 299 
Functions of r.v.'s, 122-124, 129, 136, 144 


G 
Gambler's ruin, 190 
Gamma, function, 59 
r.v., 59, 145 
Gaussian distribution (see Normal distribution) 
Geometric r.v., 55, 62 


H 
Hypergeometric r.v., 76 
Hypothesis 
alternative, 264 
composite, 264 
null, 264 
simple, 264 
Hypothesis testing, 264-275, 268 
level of significance, 265 
power of, 265 


I 
Impulse response, 214 
Independent (statistically) 
events, 8 
increments, 163 
process, 163 
r.v.'s, 80-81, 83 
Interarrival process, 170 
Intersection of sets, 2 
Interval estimate, 247 


J 

Jacobian, 125 

Joint 
characteristic function, 127 
distribution function, 80, 89 
moment-generating function, 126 
probability density function (pdf), 82 
probability mass function (pmf), 82 


K 
Karhunen-Loeve expansion, 217, 231 


L 
Lagrange multiplier, 266 
Laplace r.v., 77 
Law of large numbers, 128, 155 
Level of significance, 265 
Likelihood 

function, 248 

ratio, 265 

test, 260 
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Limiting distribution, 169 
Linear mean-square estimation, 249 
Linear system, 213, 231 
impulse response of, 214 
response to random inputs, 213-215, 231 
Little's formula, 282 
Log-normal r.v., 134 


M 
MAP (maximum a posteriori) test, 266 
Marginal 
distribution function, 81 
cumulative distribution function (cdf), 81 
probability density function (pdf), 82 
probability mass function (pmf), 81 
Markov 
chains, 164 
discrete-parameter, 165-169, 185 
homogeneous, 165 
irreducible, 167 
nonhomogeneous, 165 
regular, 169 
inequality, 66 
matrix, 166 
process, 164, 183 
property, 74, 164 
Maximum likelihood estimator, 248 
Mean, 42 
Mean square 
continuity, 209 
derivative, 209 
error, 249 
minimum, 250 
estimation, 249 
linear, 249 
integral, 210 
periodicity, 216 
Median, 76 
Memoryless property (see Markov property) 
Mercer's theorem, 217 
Minimax (min-max) test, 267 
Minimum probability of error test, 267 
Minimum variance estimator, 248 
Mixed r.v., 41 
Mode, 76 
Moment, 42, 84 
Moment generating function, 126 
Most efficient estimator, 248 
Multinomial 
coefficient, 114 
distribution, 88 
theorem, 114 
trial, 88 
Multiple r.v., 79 





307 


308 Index 


Mutually exclusive 
events, 3, 6, 9 
and exhaustive events, 8 
sets, 3 


N 
Negative binomial r.v., 77 
Neyman-Pearson test, 266, 272 
Normal 
distribution, 47, 297 
bivariate, 88 
n-variate, 88 
process, 164, 184 
r.v., 47 
standard, 47 
Null 
event (set), 3 
hypothesis, 264 
recurrent state, 168 


O 

Orthogonal r.v., 85 
Orthogonality principle, 250 
Outcomes, 1 


Р 
Parameter estimation, 247 
Parameter set, 161 
Parseval's theorem, 237 
Periodic states, 168 
Point estimators, 247, 250 
Point of occurrence, 169 
Poisson, distribution, 44, 68 
process, 169-171 

r.v., 44 

white noise, 230 
Positive recurrent states, 168 
Posterior probability, 266 
Power 

function, 269 

of test, 265 
Power spectral density (or spectrum), 210-213, 225 
Prior probability, 266 
Probability, 1 

density function (pdf), 41 

mass function (pmf), 41 


measure, 5 
Q 
Queueing 

system, 281 


theory, 281 


Index 


R 
Random 
experiment, 1 
process, 161 
complex, 161-162, 218 
independent, 163 
real, 161 
sample, 155, 247 
telegraph signal, 228 
semi, 227 
variable (r.v.), 38 
continuous, 41, 76, 82 
discrete, 41 
function of, 122 
mixed, 41 
vector, 79, 86 
walk, 173 
simple, 173, 183, 195 
Range, 38 


Rayleigh r.v., 59, 143 
Real random process, 161 
Recurrent states, 167 

null, 168 

positive, 168 
Regression line, 250 
Rejection region, 264 
Relative frequency, 5 
Renewal process, 170 


S 
Sample 
function, 161 
mean, 128, 155 
point, 1 
space, 1 
variance, 262 
vector (see Random sample) 
Sets, 1 
algebra of, 2-5, 12 
disjoint, 3 
intersection of, 2 
mutually exclusive, 3 
union of, 2 
Simple 
hypothesis, 264 
random walk, 173, 183, 195 
Standard 
deviation, 43 
normal r.v., 47 
State space, 161 
States 
absorbing, 168 
accessible, 167 
aperiodic, 168 
periodic, 168 
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States (continued) 
recurrent, 167 
null, 168 
positive, 168 
transient, 168 
Stationary 
distributions, 169 
independent increments, 163 
processes, 163 
strict sense, 163 
wide sense (WSS), 163 
transition probability, 165 
Statistic, 247 
sufficient, 277 
Stochastic, continuity, 209 
derivative, 209 
integral, 210 
matrix, (see Markov matrix) 
periodicity, 216 
process, (see Random process) 
System, linear, 213 
linear time invariance (ТТТ), 213-216 
response to random inputs, 213-216 
parallel, 33 
series, 12 


T 

Threshold value, 266 

Time-average, 165 

Time autocorrelation function, 165 

Total probability, 8 

Traffic intensity, 283-284 

Transient states, 168 

Transition probability, 165 
matrix, 165 
stationary, 165 

Type I error, 264 

Type II error, 264 


U 

Unbiased estimator, 247 

Uncorrelated r.v.'s, 85 

Uniform, distribution, 45 
r.v., 45 

Union of sets, 2 

Unit, impulse function (see Dirac d function) 
impulse sequence, 213 
sample response, 214 
sample sequence, 213 

Universal set, 1 


V 

Variance, 42 
conditional, 85 

Vector mean, 89 


Index 


Index 
Venn diagram, 3 


үү 
Waiting time, 202 
White noise, 213, 229 
normal (or gaussian), 229 
Poisson, 230 
Wiener-Khinchin relations, 211 
Wiener process, 172, 204 
standard, 172 
with drift coefficient, 172 
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